








Definition of some units 


Angstrom 

Fermi 

Bam 


1 A * 10 10 m (order of magnitude of 
the atomic dimensions) 

IF = 10“ 15 m (order of magnitude of 
the nuclear dimensions) 

1 b = 10“ 28 m 2 = (10“ 4 A) 2 = (10 F) 2 


Electron Volt 


l eV = 1.602189(5) x 10“ 19 joule 


Useful orders of magnitude 

Electron rest energy : m e c 2 cs 0.5 MeV [0.511 003(1) x 10* eV] 

< Proton rest energy : M p c 2 ~ I 000 MeV [938.280(3) x 10* eV] 

Neutron rest energy : M„c 2 a 1 000 MeV [939.573(3) x 10 6 eV] 

One electron volt corresponds to: 

” a frequency v cs 2.4 x 10 14 Hz through the relation E = hv 

[2.417 970(7) x 10 14 Hz] 

a wavelength A ^ 12 000 A through the relation A = c/v 
I [12 398.52(4) A] 

a wavenumber 8000cm -1 [8 065.48(2) cm -1 ] 

a temperature 7" ~ 12 000 K through the relation E — k B T 
L [11 604.5(4) K] 


In a 1 gauss magnetic field (10 4 Tesla): 
the electron cyclotron frequency v f = o)J2n — — qB/2nm e 

is v c ~ 2.8 MHz [2.799 225(8) x 10* Hz] 


< the orbital Larmor frequency v L = io L /2it = — n B B/h = vj2 

is v L ~ 1.4 MHz [1.399 612(4) x 10* Hz] 

(this corresponds by definition to a g — 1 Lande factor) 





Some general physical constants 


Planck’s constant 


h = 6.626 18(4) x 10 34 joule second 

Ti ^ = 1.054 589(6) x 10” 34 joule second 

L 71 


Speed of light (in vacuum) 
Electron charge 
Electron mass 
Proton mass 
Neutron mass 


c = 2.997 924 58(1) x 10 8 m/s 
q m - 1.602 189(5) x 10" 19 coulomb 
m t = 9.109 53(5) x 10" 31 kg 
M p = 1.672 65(1) x 10' 27 kg 
M n = 1.674 95(1) x 10“ 27 kg 

= 1 836.1515(7) 


Electron Compton wavelength 


Fine structure constant 
(dimensionless) 


f X e * h/m e c = 2.426 309(4) x 10" 2 A 
\X C = h/m e c = 3.861 591(7) x 10" 3 A 

_ q 2 _ e 2 _ 1 

4ite,jkc Tic 137.0360(1) 


Bohr radius 


a 0 ~ ^ = 0.529 177 1(5) A 


Hydrogen atom ionization 

energy (without proton recoil - E u = a 2 m e c 2 /2 = 13.605 80(5) eV 
effect) 


Rydberg’s constant 
” Classical ” electron radius 

Bohr magneton 
Electron spin g factor 
Nuclear magneton 
Boltzmann's constant 
Avogadro’s number 


R* * - E,Jhc * 1.097 373 18(8) x 10 s cm" 1 

r e =-= 2.817 938(7) fermi 

* 47ie 0 m e c 2 

H B - qfi/2m e — - 9.274 08(4) x 10“ 24 joule/tesla 
= 2 x 1.001159 657(4) 

H H = - qh/2M p = 5.05082(2) x 10‘ 27 joule/tesla 
k B = 1.380 66(4) x 10~ 23 joule/K 
N a = 6.02205(3) x 10 23 



Useful Identities 

U : scalar field; A, B, 
V x (VC/) = 0 
V. ( V x A) = 0 

h 

-r x V 

i 

r d i , 

--r x L 

r dr Hr 2 

i a 2 l 2 

r dr 2 Ti 2 r 2 


A x (B x C) = (A . C)B - (A . B)C 

A x (B x C) + B x (C x A) + C x (A x B) = 0 

(A x B). (C x D) = (A . C)(B . D) - (A . D)(B . C) 

(A x B) x (C x D) = [(A x B). D]C - [(A x B). C]D 

= [(C x D). A]B - [(C x D) B] A 


L = 

V = 

A = 


... : vector fields. 

V. (VC/) = AC/ 

V x ( Vx A) = V (V. A) -AA 


V {UV) = U \V + V vu 

A (UV) = UAV + 2{\U). (W) + VAU 

V. (t/A) = U V.A + A .\U 

V x (C/A) = U VxA + (VC/) x A 

V.(AxB) = B.(VxA)-A.(VxB) 

V(A.B) = A x (V x B) + B x (Vx A) + B.V A + A.V B 

V x (A x B) = A (V. B) - B (V. A) + B. V A-A.V B 


N.B. : B .V A vector field whose components are 

(B.V A), - B/tjA, - l B± A i 

j V ^j 
(/ = x, y, z) 


Coordinate systems 




•' - m\*VA 2 \ = "(v X A >| - (f ^‘Cv d )s'i = 2 (v * A ) - xg- x vj .) + 

(0 uis j) 9 u ! s ~ <*V J K?] = *(V X A )j d2' z V2 - -2 < ’v-2 = *(V x a ) X ^2"V2 ~ - 2 <'V 1?) + 

(0 UIS Ai[d>g! 9 vg - 92 /Cv 0 owfc?] = 1v x A ); -ifv 2 - 4\4*i< s V2) »1v*4) *Vi ~ WV}) = 




Introduction 


Structure and level of this text 


It is hardly necessary to emphasize the importance of quantum mechanics 
in modern physics and chemistry. Current university programs naturally reflect 
this importance. In French universities, for example, an essentially qualitative 
introduction to fundamental quantum mechanical ideas is given in the second year. 
In the final year of the undergraduate physics program, basic quantum mechanics 
and its most important applications are studied in detail. 

This book is the direct result of several years of teaching quantum mechanics 
in the final year of the undergraduate program, first in two parallel courses at the 
Faculte des Sciences in Paris and then at the Universites Paris VI and Paris VII. 
We felt it to be important to mark a clear separation, in the structure of this book, 
between the two different but complementary aspects (lectures and recitations) of the 
courses given during this time. This is why we have divided this text into two 
distinct parts (see the “Directions for Use” at the beginning of the book). 
On the one hand, the chapters are based on the lectures given in the two courses, 
which we compared, discussed and expanded before writing the final version. 
On the other hand, the “complements” grew out of the recitations, exercises and 
problems given to the students, and reports that some of them prepared. Ideas 
also came from other courses given under other circumstances or at other levels 
(particularly in the graduate programs). As we pointed out in the “Directions for 
Use”, the chapters as a whole constitute, more or less, a course we would 
envisage teaching to fourth-year college students or those whose level is equivalent. 
However, the complements are not intended to be treated in a single year. The 
reader, teacher or student, must choose between them in accordance with his 
interests, tastes and goals. 

Throughout the writing of this book, our constant concern has been to 
address ourselves to students majoring in physics, like those we have taught 
over the past several years. Except in a few complements, we have not overstepped 
those limits. In addition, we have endeavored to take into account what we 
have seen of students’ difficulties in understanding and assimilating quantum 


3 



INTRODUCTION 


mechanics, as well as their questions. We hope, of course, that this book will 
also be of use to other readers such as graduate students, beginning research 
workers and secondary school teachers. 

The reader is not required to be familiar with quantum physics : few of our 
students were. However, we do think that the quantum mechanics course we propose 
(see “General approach”, below) should be supplemented by a more descriptive and 
more experimentally oriented course, in atomic physics for example. 


General approach 

We feel that familiarity with quantum mechanics can best be acquired by 
using it to solve specific problems. We therefore introduce the postulates of quantum 
mechanics very early (in chapter III), so as to be able to apply them in the rest of 
the book. Our teaching experience has shown it to be preferable to introduce all 
the postulates together in the beginning rather than presenting them in several 
stages. Similarly, we have chosen to use state spaces and Dirac notation from the 
very beginning. This avoids the useless repetition which results from presenting 
the more general bra-ket formalism only after having developed wave mechanics 
uniquely in terms of wave functions. In addition, a belated change in the notation 
runs the risk of confusing the student, and casting doubts on concepts which he 
has only just acquired and not yet completely assimilated. 

After a chapter of qualitative introduction to quantum mechanical ideas, 
which uses simple optical analogies to familiarize the reader with these new concepts, 
we present, in a systematic fashion, the mathematical tools (chapter II) and 
postulates of quantum mechanics as well as a discussion of their physical content 
(chapter III). This enables the reader, from the beginning, to have an overall view 
of the physical consequences of the new postulates. Starting with the complements 
of chapter III we take up applications, beginning with the simplest ones (two-level 
systems, the harmonic oscillator, etc.) and becoming gradually more complicated 
(the hydrogen atom, approximation methods, etc.). Our intention is to provide 
illustrations of quantum mechanics by taking many examples from different fields 
such as atomic physics, molecular physics and solid state physics. In these examples 
we concentrate on the quantum mechanical aspect of the phenomena, often 
neglecting specific details which are treated in more specialized texts. Whenever 
possible, the quantum mechanical results are compared with the classical ones in 
order to help the reader develop his intuition concerning quantum mechanical 
effects. 

This essentially deductive viewpoint has led us to avoid stressing the historical 
introduction of quantum mechanical ideas, that is, the presentation and discussion 
of experimental facts which force us to reject the classical ideas. We have thus had 
to forego the inductive approach, which is nevertheless needed if physics is to be 
faithfully portrayed as a science in continual evolution, provoked by constant 
confrontation with experimental facts. Such an approach seems to us to be better 
suited to an atomic physics text or an introductory quantum physics course on a 
more elementary level. 

Similarly, we have deliberately avoided any discussion of the philisophical 
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implications of quantum mechanics and of alternative interpretations that have 
been proposed. Such discussions, while very interesting (see section 5 of the biblio¬ 
graphy), seem to us to belong on another level. We feel that these questions can be 
fruitfully considered only after one has mastered the “orthodox” quantum theory 
whose impressive successes in all fields of physics and chemistry compelled its 
acceptance. 
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The teaching experiences out of which this text grew were group efforts, 
pursued over several years. We want to thank all the members of the various 
groups and particularly, Jacques Dupont-Roc and Serge Haroche, for their friendly 
collaboration, for the fruitful discussions we have had in our weekly meetings and 
for the ideas for problems and exercises which they have suggested. Without their 
enthusiasm and their valuable help, we would never have been able to undertake 
and carry out the writing of this book. 

Nor can we forget what we owe to the physicists who introduced us to 
research, Alfred Kastler and Jean Brossel for two of us and Maurice Levy for 
the third. It was in the context of their laboratories that we discovered the 
beauty and power of quantum mechanics. Neither have we forgotten the importance 
to us of the modern physics taught at the C.E.A. by Albert Messiah, Claude Bloch 
and Anatole Abragam, at a time when graduate studies were not yet incorporated 
into French university programs. 

We wish to express our gratitude to Ms. Aucher, Baudrit, Boy, Brodschi, 
Emo, Heyvaerts, Lemirre, Touzeau for preparation of the manuscript. 
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Foreword 


This book is essentially a translation of the French edition which appeared at the end 
of 1973. 

The text has undergone a certain number of modifications. The most important one is an 
addition of a detailed bibliography, with suggestions concerning its use appearing at the end of 
each chapter or complements. 

This book was originally conceived for French students finishing their undergraduate 
studies or beginning their research work. It seems to us however that the structure of this book 
(separation into chapters and complements — see the “Directions for use”) should make it 
suitable for other groups of readers. For example, for an undergraduate elementary Quantum 
Mechanics course, we would recommend using the most important chapters with their simplest 
complements. For a more advanced course, one could add the remaining chapters and use more 
difficult complements. Finally, it is hoped that some of the more advanced complements will 
help students in the transition from a regular Quantum Mechanics course to current research 
topics in various fields of Physics. 

We wish to thank Nicole and Dan Ostrowsky, as well as Susan Hemley, for the care and 
enthusiasm which they brought to this translation. Their remarks often led to an improvement of 
the original text. In addition, we are grateful to Mrs. Audoin and Mrs. Mathieu for their aid in 
organizing the bibliography. 


C. Cohen-Tannoudji 
B. Diu 
F. Laloe 
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Waves and particles. 
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to the fundamental ideas 
of quantum mechanics 
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a . Existence of stationary states 

b . Superposition of stationary states 

2 . One-dimensional “ square ” potentials. Qualitative study 

a. Physical meaning of a square potential 

b. Optical analogy 

c. Examples 
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CHAPTER I WAVES AND PARTICLES 


In the present state of scientific knowledge, quantum mechanics plays a funda¬ 
mental role in the description and understanding of natural phenomena. In fact, 
phenomena which occur on a very small (atomic or subatomic) scale cannot be 
explained outside the framework of quantum physics. For example, the existence and 
the properties of atoms, the chemical bond and the propagation of an electron in 
a crystal cannot be understood in terms of classical mechanics. Even when we are 
concerned only with macroscopic physical objects (that is, whose dimensions are 
comparable to those encountered in everyday life), it is necessary, in principle, to 
begin by studying the behavior of their various constituent atoms, ions, elec¬ 
trons, in order to arrive at a complete scientific description. There are many 
phenomena which reveal, on a macroscopic scale, the quantum behaviour of 
nature. It is in this sense that it can be said that quantum mechanics is the basis of 
our present understanding of all natural phenomena, including those traditionally 
treated in chemistry, biology, etc... 

From a historical point of view, quantum ideas contributed to a remarkable 
unification of the concepts of fundamental physics by treating material particles 
and radiation on the same footing. At the end of the nineteenth century, people 
distinguished between two entities in physical phenomena : matter and radiation. 
Completely different laws were used for each one. To predict the motion of material 
bodies, the laws of Newtonian mechanics ( cf appendix III) were utilized. Their success, 
though of long standing, was none the less impressive. With regard to radiation, 
the theory of electromagnetism , thanks to the introduction of Maxwell’s equations, 
had produced a unified interpretation of a set of phenomena which had previously 
been considered as belonging to different domains: electricity, magnetism and 
optics. In particular, the electromagnetic theory of radiation had been spectacularly 
confirmed experimentally by the discovery of Hertzian waves. Finally, interactions 
between radiation and matter were well explained by the Lorentz force. This set 
of laws had brought physics to a point which could be considered satisfactory, in 
view of the experimental data at the time. 

However, at the beginning of the twentieth century, physics was to be 
marked by the profound upheaval that led to the introduction of relativistic 
mechanics and quantum mechanics. The relativistic " revolution” and the quantum 
" revolution” were, to a large extent, independent, since they challenged classical 
physics on different points. Classical laws cease to be valid for material bodies 
travelling at very high speeds, comparable to that of light (relativistic domain). 
In addition, they are also found to be wanting on an atomic or subatomic scale 
(quantum domain). However, it is important to note that classical physics, in both 
cases, can be seen as an approximation of the new theories, an approximation 
which is valid for most phenomena on an everyday scale. For example, Newtonian 
mechanics enables us to predict correctly the motion of a solid body, providing 
it is non-relativistic (speeds much smaller than that of light) and macroscopic 
(dimensions much greater than atomic ones). Nevertheless, from a fundamental 
point of view, quantum theory remains indispensable. It is the only theory which 
enables us to understand the very existence of a solid body and the values of the 
macroscopic parameters (density, specific heat, elasticity, etc...) associated with it. 
At the present time, we do not yet have at our disposal a fully satisfactory theory 
unifying quantum and relativistic mechanics since difficulties have arisen in this 
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domain. However, most atomic and molecular phenomena are well explained by 
the non-relativistic quantum mechanics that we intend to examine here. 

This chapter is an introduction to quantum ideas and "vocabulary”. No 
attempt is made here to be rigorous or complete. The essential goal is to awaken the 
curiosity of the reader. Phenomena will be described which unsettle ideas as firmly 
anchored in our intuition as the concept of a trajectory. We want to render the 
quantum theory "plausible” for the reader by showing simply and qualitatively 
how it enables us to solve the problems which are encountered on an atomic scale. 
We shall later return to the various ideas introduced in this chapter and go into 
further detail, either from the point of view of the mathematical formalism (chap. II) 
or from the physical point of view (chap. III). 

In the first section (§A), we introduce the basic quantum ideas (wave- 
particle duality, the measurement process), relying on well-known optical experi¬ 
ments. Then we show (§ B) how these ideas can be extended to material particles 
(wave function, Schrodinger equation). We next study in more detail the characte¬ 
ristics of the "wave packet” associated with a particle, and we introduce the 
Heisenberg uncertainty relations (§ C). Finally, we discuss some simple cases of 
typical quantum effects (§ D). 


A. ELECTROMAGNETIC WAVES AND PHOTONS 

1. Light quanta and the Planck-Einstein relations 

Newton considered light to be a beam of particles, able, for example, to 
bounce back upon reflection from a mirror. During the first half of the nineteenth 
century, the wavelike nature of light was demonstrated (interference, diffraction). 
This later enabled optics to be integrated into electromagnetic theory. In this 
framework, the speed of light, c, is related to electric and magnetic constants and 
light polarization phenomena can be interpreted as manifestations of the vectorial 
character of the electric field. 

However, the study of blackbody radiation, which electromagnetic theory 
could not explain, led Planck to suggest the hypothesis of the quantization of 
energy (1900): for an electromagnetic wave of frequency v, the only possible 
energies are integral multiples of the quantum hv, where h is a new fundamental 
constant. Generalizing this hypothesis, Einstein proposed a return to the particle 
theory (1905): light consists of a beam of photons, each possessing an energy hv. 
Einstein showed how the introduction of photons made it possible to understand, 
in a very simple way, certain as yet unexplained characteristics of the photoelectric 
effect. Twenty years had to elapse before the photon was actually shown to exist, 
as a distinct entity, by the Compton effect (1924). 

These results lead to the following conclusion : the interaction of an electro¬ 
magnetic wave with matter occurs by means of elementary indivisible processes, 
in which the radiation appears to be composed of particles, the photons. Particle 
parameters (the energy E and the momentum p of a photon) and wave parameters 
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(the angular frequency co = 2nv and the wave vector k, where |k| = 2 71 /A, with v 
the frequency and A the wavelength) are linked by the fundamental relations: 


E = hv = hco 
p = ftk 


(Planck-Einstein relations) 


(A-l) 


where h = h/ln is defined in terms of the Planck constant h : 

h ~ 6.62 10 -34 Joule x second (A-2) 

During each elementary process, energy and total momentum must be conserved. 


2. Wave-particle duality 

Thus we have returned to a particle conception of light. Does this mean that 
we must abandon the wave theory? Certainly not. We shall see that typical wave 
phenomena such as interference and diffraction could not be explained in a purely 
particle framework. Analyzing Young’s well-known double-slit experiment will 
lead us to the following conclusion : a complete interpretation of the phenomena 
can be obtained only by conserving both the wave aspect and the particle aspect 
of light (although they seem a priori irreconcilable). We shall then show how this 
paradox can be resolved by the introduction of the fundamental quantum concepts. 


a. ANALYSIS OF YOUNG'S DOUBLE-SLIT EXPERIMENT 

The device used in this experiment is shown schematically in figure 1. The 
monochromatic light emitted by the source if falls on an opaque screen & pierced 
by two narrow slits F { and F 2 , which illuminate the observation screen £ (a photo¬ 
graphic plate, for example). If we block F 2 , we obtain on £ a light intensity distri¬ 
bution /,(*) which is the diffraction pattern of F l . In the same way, when F, is 
obstructed, the diffraction pattern of F 2 is described by I 2 (x). When the two 
slits F 1 and F 2 are open at the same time, we observe a system of interference 
fringes on the screen. In particular, we note that the corresponding intensity I(x) is 
not the sum of the intensities produced by F l and F 2 separately: 

I(x) # I^x) + I 2 (x) (A-3) 

How could one conceive of explaining, in terms of a particle theory (seen, in 
the preceding section, to be necessary), the experimental results just described ? 
The existence of a diffraction pattern when only one of the two slits is open could, 
for example, be explained as being due to photon collisions with the edges of the 
slit. Such an explanation would, of course, have to be developed more precisely, 
and a more detailed study would show it to be insufficient. Instead, let us concen¬ 
trate on the interference phenomenon. We could attempt to explain it by an 
interaction between the photons which pass through the slit F 1 and those which 
pass through the slit F 2 . Such an explanation would lead to the following predic- 
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tion : if the intensity of the source (the number of photons emitted per second) 
is diminished until the photons strike the screen practically one by one, the inter¬ 
action between the photons must diminish and, eventually, vanish. The interference 
fringes should therefore disappear. 




Diagram of Young’s double-slit light interference experiment (fig. a). Each of the slits F\ and F 2 
produces a diffraction pattern on the screen S. The corresponding intensities are 7,(x) and 7 2 (x) 
(solid lines in figure b). When the two sb'ts F t and 7, are open simultaneously, the intensity 7(x) 
observed on the screen is not the sum 7, (x) + 7 2 (x) (dashed lines in figures b and c), but shows oscil¬ 
lations due to the interference between the electric fields radiated by F, and F 2 (solid line in figure c). 


Before we indicate the answer given by experiment, recall that the wave 
theory provides a completely natural interpretation of the fringes. The light inten¬ 
sity at a point of the screen S is proportional to the square of the amplitude of the 
electric field at this point. If E 1 (x) and E 2 (x) represent, in complex notation, the 
electric fields produced at x by slits F l and F 2 respectively (the slits behave like 
secondary sources), the total resultant field at this point when F x and F 2 are both 
open is * : 

E(x) = E,(x) + E 2 { x) (A-4) 

Using complex notation, we then have : 

I(x) oc |£(*)| 2 = ^(jc) + £ 2 (x)| 2 (A-5) 

Since the intensities 1 1 (.v) and I 2 (x) are proportional, respectively, to l^^x)) 2 
and \E 2 (x)\ 2 , formula (A-5) shows that I(x) differs from I 2 (x) + I 2 {x) by an 
interference term which depends on the phase difference between E l and E 2 and 
whose presence explains the fringes. The wave theory thus predicts that diminishing 
the intensity of the source will simply cause the fringes to diminish in intensity 
but not to vanish. 


* Since the experiment studied here is performed with unpolarized light, the vectorial character 
of the electric field does not play an essential role. For the sake of simplicity, we ignore it in this paragraph. 
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What actually happens when emits photons practically one by one? 
Neither the predictions of the wave theory nor those of the particle theory are verified. 
In fact : 

(i) If we cover the screen S with a photographic plate and increase the 
exposure time so as to capture a large number of photons on each photograph, we 
observe when we develop them that the fringes have not disappeared. Therefore, 
the purely corpuscular interpretation, according to which the fringes are due to an 
interaction between photons, must be rejected. 

(ii ) On the other hand, we can expose the photographic plate during a time 
so short that it can only receive a few photons. We then observe that each photon 
produces a localized impact on S and not a very weak interference pattern. 
Therefore, the purely wave interpretation must also be rejected. 

In reality, as more and more photons strike the photographic plate, the 
following phenomenon occurs. Their individual impacts seem to be distributed in 
a random manner, and only when a great number of them have reached S does the 
distribution of the impacts begin to have a continuous aspect. The density of the 
impacts at each point of $ corresponds to the interference fringes : maximum on 
a bright fringe and zero on a dark fringe. It can thus be said that the photons, as 
they arrive, build up the interference pattern. 

The result of this experiment therefore leads, apparently, to a paradox. Within 
the framework of the particle theory, for example, it can be expressed in the 
following way. Since photon-photon interactions are excluded, each photon must 
be considered separately. But then it is not clear why the phenomena should change 
drastically according to whether only one slit or both slits are open. For a photon 
passing through one of the slits, why should the fact that the other is open or 
closed have such a critical importance ? 

Before we discuss this problem, note that in the preceding experiment we did 
not seek to determine through which slit each photon passed before it reached the 
screen. In order to obtain this information, we can imagine placing detectors 
(photomultipliers) behind F 1 and F 2 . It will then be observed that, if the photons 
arrive one by one, each one passes through a well-determined slit (a signal is 
recorded either by the detector placed behind F l or by the one covering F 2 but not 
by both at once). But, obviously, the photons detected in this way are absorbed 
and do not reach the screen. Remove the photomultiplier which blocks F lt for 
example. The one which remains behind F 2 tells us that, out of a large number 
of photons, about half pass through F 2 . We conclude that the others (which can 
continue as far as the screen) pass through F i . But the pattern that they gradually 
construct on the screen is not an interference pattern, since F 2 is blocked. It is only 
the diffraction pattern of F v 


b. QUANTUM UNIFICATION OF THE TWO ASPECTS OF LIGHT 

The preceding analysis shows that it is impossible to explain all the pheno¬ 
mena observed if only one of the two aspects of light, wave or particle, is considered. 
Now these two aspects seem to be mutually exclusive. To overcome this difficulty, 
it thus becomes indispensable to reconsider in a critical way the concepts of 
classical physics. We must accept the possibility that these concepts, although our 
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everyday experience leads us to consider them well-founded, may not be valid 
in the new ("microscopic”) domain which we are entering. For example, an 
essential characteristic of this new domain appeared when we placed counters behind 
Young's slits : when one performs a measurement on a microscopic system, one 
disturbs it in a fundamental fashion. This is a new property since, in the macroscopic 
domain, we always have the possibility of conceiving measurement devices whose 
influence on the system is practically as weak as one might wish. This critical 
revision of classical physics is imposed by experiment and must of course be 
guided by experiment. 

Let us reconsider the "paradox” stated above concerning the photon 
which passes through one slit but behaves differently depending on whether 
the other slit is open or closed. We saw that if we try to detect the photons 
when they cross the slits, we prevent them from reaching the screen. More 
generally, a detailed experimental analysis shows that it is impossible to observe 
the interference pattern and to know at the same time through which slit each 
photon has passed (cf complement D,). Thus it is necessary, in order to resolve 
the paradox, to give up the idea that a photon inevitably passes through a 
particular slit. We are then led to question the concept, which is a fundamental 
one of classical physics, of a particle’s trajectory. 

Moreover, as the photons arrive one by one, their impacts on the screen 
gradually build up the interference pattern. This implies that, for a particular 
photon, we are not certain in advance where it will strike the screen. Now these 
photons are all emitted under the same conditions. Thus another classical idea 
has been destroyed : that the initial conditions completely determine the sub¬ 
sequent motion of a particle. We can only say, when a photon is emitted, that 
the probability of its striking the screen at x is proportional to the intensity I(x) 
calculated using wave theory, that is, to |is(x)| 2 . 

After many tentative efforts that we shall not describe here, the concept 
of wave-particle duality was formulated. We can summarize it schematically 
as follows* : 

(i) The particle and wave aspects of light are inseparable. Light behaves 
simultaneously like a wave and like a flux of particles, the wave enabling us 
to calculate the probability of the manifestation of a particle. 

(ii) Predictions about the behavior of a photon can only be probabilistic. 

(hi) The information about a photon at time t is given by the wave 

£(r, /), which is a solution of Maxwell’s equations. We say that this wave 
characterizes the state of the photons at time t. E( r, t) is interpreted as the 
probability amplitude of a photon appearing, at time t, at the point r. This means 
that the corresponding probability is proportional to |.£(r, t)\ 2 . 

comments : 

(/) Since Maxwell’s equations are linear and homogeneous, we can use a 

superposition principle : if E t and E 2 are two solutions of these equations, 

then E = k t E 2 + X 2 E 2 , where X x and X 2 are constants, is also a solution. 

* It is worth noting that this interpretation of physical phenomena, generally considered to be 
"orthodox” at the present time, is still being contested today by certain physicists. 
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It is this superposition principle which explains wave phenomena in classical 
optics (interference, diffraction). In quantum physics, the interpretation 
of E{ r, t) as a probability amplitude is thus essential to the persistence 
of such phenomena. 

(ii) The theory merely allows one to calculate the probability of the occurence 
of a given event. Experimental verifications must thus be founded on the 
repetition of a large number of identical experiments. In the above 
experiment, a large number of photons, all produced in the same way, 
are emitted successively and build up the interference pattern, which is 
the manifestation of the calculated probabilities. 

(iii) We are talking here about "the photon state” so as to be able to develop 
in § B an analogy between £(r, t) and the wave function i//(r, t) which 
characterizes the quantum state of a material particle. This “ optical analogy ” 
is very fruitful. In particular, as we shall see in § D, it allows us to 
understand, simply and without recourse to calculation, various quantum 
properties of material particles. However, we should not push it too far 
and let it lead us to believe that it is rigorously correct to consider 
E(r, t) as characterizing the quantum state of a photon. 


Furthermore, we shall see that the fact that r, t) is complex is essential in 
quantum mechanics, while the complex notation E(t, t) is used in optics purely for 
convenience (only its real part has a physical meaning). The precise definition of the 
(complex) quantum state of radiation can only be given in the framework of quantum 
electrodynamics, a theory which is simultaneously quantum mechanical and relativistic. 
We shall not consider these problems here (we shall touch on them in complement K v ). 


3. The principle of spectral decomposition 

Armed with the ideas introduced in § 2, we are now going to discuss 
another simple optical experiment, whose subject is the polarization of light. 
This will permit us to introduce the fundamental concepts which concern the 
measurement of physical quantities. 

The experiment consists of directing a polarized plane monochromatic light 
wave onto an analyzer A. Oz designates the direction of propagation of this 
wave and e p , the unit vector describing its polarization (cf. fig. 2). The analyzer A 
transmits light polarized parallel to Ox and absorbs light polarized parallel 
to Oy. 

The classical description of this experiment (a description which is valid 
for a sufficiently intense light beam) is the following. The polarized plane wave 
is characterized by an electric field of the form: 

E(r, t) = E 0 e p e i(kz ~ a>,) (A-6) 

where E 0 is a constant. The light intensity / is proportional to |.E 0 | 2 . After 
its passage through the analyzer A, the plane wave is polarized along Ox : 

E'(r, t) = E' 0 e x e i(kz ~ w,) (A-7) 
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and its intensity /', proportional to |.fol 2 , ‘ s given by Malus’law: 

r — I cos 2 9 (A-8) 

[e x is the unit vector of the Ox axis and 9 is the angle between e x and e p ]. 



FIGURE 2 

A simple measurement experiment relating to the polarization of a light wave. A beam of-Hght 
propagates along the direction Oz and crosses successively the polarizer P and the analyzer A. 
6 is the angle between Ox and the electric held of the wave transmitted by P. The vibrations trans¬ 
mitted by A are parallel to Ox. 


What will happen on the quantum level, that is, when / is weak enough 
for the photons to reach the analyzer one by one? (We then place a photon 
detector behind this analyser.) First of all, the detector never registers a “fraction 
of a photon”. Either the photon crosses the analyzer or it is entirely absorbed 
by it. Next (except in special cases that we shall examine in a moment), we 
cannot predict with certainty whether a given incident photon will pass or be 
absorbed. We can only know the corresponding probabilities. Finally, if we 
send out a large number N of photons one after the other, the result will 
correspond to the classical law, in the sense that about N cos 2 6 photons will be 
detected after the analyzer. 

We shall retain the following ideas from this description : 

(i) The measurement device (the analyzer, in this case) can give only certain 
privileged results, which we shall call eigen (or proper) results*. In the above 
experiment, there are only two possible results : the photon crosses the analyzer or 
it is stopped. One says that there is quantization of the result of the measurement, 
in contrast to the classical case [cf. formula (A-8)] where the transmitted 
intensity /' can vary continuously, according to the value of 9, between 0 and /. 


* The reason for this name will appear in chapter III. 
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(ii) To each of these eigen results corresponds an eigenstate. Here, the 
two eigenstates are characterized by: 


or 



(A-9) 


(e y is the unit vector of the Oy axis). If e p = e x , we know with certainty that 
the photon will traverse the analyzer; if e p = e y , it will, on the contrary, 
definitely be stopped. The correspondence between eigen results and eigenstates 
is therefore the following. If the particle is, before the measurement, in one of 
the eigenstates, the result of this measurement is certain: it can only be the 
associated eigen result. 

(Hi) When the state before the measurement is arbitrary, only the proba¬ 
bilities of obtaining the different eigen results can be predicted. To find these 
probabilities, one decomposes the state of the particles into a linear combination 
of the various eigenstates. Here, for an arbitrary e p , we write : 

e p = e x cos 8 + e y sin 8 (A-10) 

The probability of obtaining a given eigen result is then proportional to the 
square of the absolute value of the coefficient of the corresponding eigenstate. 
The proportionality factor is determined by the condition that the sum of all 
these probabilities must be equal to 1. We thus deduce from (A-10) that each 
photon has a probability cos 2 8 of traversing the analyzer and a probability sin 2 8 
of being absorbed by it (we know that cos 2 8 + sin 2 8 = 1). This is indeed 
what was stated above. This rule is called in quantum mechanics the principle 
of spectral decomposition. Note that the decomposition to be performed depends 
on the type of measurement device being considered, since one must use the 
eigenstates which correspond to it: in formula (A-10), the choice of the axes Ox 
and Oy is fixed by the analyzer. 

(iv) After passing through the analyzer, the light is completely polarized 
along e*. If we place, after the first analyzer A, a second analyzer A', having 
the same axis, all the photons which traversed A will also traverse A'. According 
to what we have just seen in point (ii), this means that, after they have crossed A, 
the state of the photons is the eigenstate characterized by e x . There has therefore 
been an abrupt change in the state of the particles. Before the measurement, 
this state was defined by a vector E(r, t) which was collinear with e p . After 
the measurement, we possess an additional piece of information (the photon 
has passed) which is incorporated by describing the state by a different vector, 
which is now collinear with e x . This expresses the fact, already pointed out 
in § A-2, that the measurement disturbs the microscopic system (here, the photon) 
in a fundamental fashion. 


comment: 

The certain prediction of the result when e p = e x or e p = e y is only a special case. 
The probability of one of the possible events is then indeed equal to 1. But, in order to 
verify this prediction, one must perform a large number of experiments. One must be 
sure that all the photons pass (or are stopped), since the fact that a particular photon 
crosses the analyzer (or is absorbed) is not characteristic of e p = e* (or e p = e^). 
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B. MATERIAL PARTICLES AND MATTER WAVES 


1. The de Broglie relations 


Parallel to the discovery of photons, the study of atomic emission and 
absorption spectra uncovered a fundamental fact, which classical physics was 
unable to explain : these spectra are composed of narrow lines. In other words, 
a given atom emits or absorbs only photons having well-determined frequencies 
(that is, energies). This fact can be interpreted very easily if one accepts that 
the energy of the atom is quantized, that is, it can take on only certain discrete 
values E t (i = 1, 2,the emission or absorption of a photon is then 
accompanied by a "jump” in the energy of the atom from one permitted 
value E { to another E y Conservation of energy implies that the photon has 
a frequency v (j such that: 

~Ej\ (B-l) 

Only frequencies which obey (B-l) can therefore be emitted or absorbed by 
the atom. 

The existence of such discrete energy levels was confirmed independently 
by the Franck-Hertz experiment. Bohr interpreted this in terms of privileged 
electronic orbits and stated, with Sommerfeld, an empirical rule which permitted 
the calculation of these orbits for the case of the hydrogen atom. But the 
fundamental origin of these quantization rules remained mysterious. 

In 1923, however, de Broglie put forth the following hypothesis: material 
particles, just like photons, can have a wavelike aspect. He then derived the 
Bohr-Sommerfeld quantization rules as a consequence of this hypothesis, the 
various permitted energy levels appearing as analogues of the normal modes of a 
vibrating string. Electron diffraction experiments (Davisson and Germer, 1927) 
strikingly confirmed the existence of a wavelike aspect of matter by showing that 
interference patterns could be obtained with material particles such as electrons. 

One therefore associates with a material particle of energy E and momentum p, 
a wave whose angular frequency to = 27tv and wave vector k are given by 
the same relations as for photons (cf. § A-l): 



In other words, the corresponding wavelength is: 


2n _ h 

k l ~ IpI 


(de Broglie relation) 


(B-3) 


comment : 

The very small value of the Planck constant h explains why the wavelike nature 
of matter is very difficult to demonstrate on a macroscopic scale. Complement A, of this 
chapter discusses the orders of magnitude of the de Broglie wavelengths associated with 
various material particles. 
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2. Wave functions. Schrodinger equation 

In accordance with de Broglie’s hypothesis, we shall apply the ideas 
introduced in § A for the case of the photon to all material particles. Recalling 
the conclusions of this paragraph, we are led to the following formulation : 

(/) For the classical concept of a trajectory, we must substitute the concept 
of a time-varying state. The quantum state of a particle such as the electron* is 
characterized by a wave function t), which contains all the information it is 
possible to obtain about the particle. 

(ii) ip (r, t) is interpreted as a probability amplitude of the particle’s presence. 
Since the possible positions of the particle form a continuum, the probability 
d^(r, t) of the particle being, at time t, in a volume element d 3 r = dx d y dz 
situated at the point r must be proportional to d 3 r and therefore infinitesimal. 
|i/?(r, f)| 2 is then interpreted as the corresponding probability density , with: 

d^(r, t) = C \\)/(r, t )| 2 d 3 r (B-4) 

where C is a normalization constant [see comment (?) at the end of § B-2]. 

(Hi) The principle of spectral decomposition applies to the measurement 
of an arbitrary physical quantity: 

— The result found must belong to a set of eigen results ( a }. 

— With each eigenvalue a is associated an eigenstate, that is, an eigen¬ 
function if/ a (r). This function is such that, if \(/(r, t 0 ) = ip a (r) (where t 0 is the 
time at which the measurement is performed), the measurement will always 
yield a. 

— For any ^ (r, t), the probability 0> a of finding the eigenvalue a for a 
measurement at time t 0 is found by decomposing \j/(r, t 0 ) in terms of the functions 

: 

•Mr, t 0 ) = Z c a H r ) (B-5) 

a 

Then : 

a 

^the presence of the denominator insures that the total probability is equal to 1: 

V--') 

— If the measurement indeed yields a, the wave function of the particle 
immediately after the measurement is: 

^ (r, t 0 ) = ^ fl (r) (B-7) 

(tv) The equation describing the evolution of the function i (/ (r, t) remains 
to be written. It is possible to introduce it in a very natural way, using the 


* We shall not take into account here the existence of electron spin (cf. chap. IX). 
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Planck and de Broglie relations. Nevertheless, we have no intention of proving 
this fundamental equation, which is called the Schrodinger equation. We shall 
simply assume it. Later, we shall discuss some of its consequences (whose 
experimental verification will prove its validity). Besides, we shall consider this 
equation in much more detail in chapter HI. 

When the particle (of mass m) is subjected to the influence of a potential* 
V(r, t), the Schrodinger equation takes on the form : 


P *2 

in Tt ^ r ’ = “ 2w ^ r ’ + K(r ’ ') 


(B-8) 


where A is the Laplacian operator 8 2 /dx 2 + 8 2 fdy 2 + d 2 /dz 2 . 

We notice immediately that this equation is linear and homogeneous in if/. 
Consequently, for material particles, there exists a superposition principle which, 
combined with the interpretation of if/ as a probability amplitude, is the source 
of wavelike effects. Note, moreover, that the differential equation (B-8) is first- 
order with respect to time. This condition is necessary if the state of the particle 
at a time t 0 , characterized by \f/(r, t 0 ), is to determine its subsequent state. 

Thus there exists a fundamental analogy between matter and radiation : 
in both cases, a correct description of the phenomena necessitates the introduction 
of quantum concepts, and, in particular, the idea of wave-particle duality. 


comments : 


(i) For a system composed of only one particle, the total probability of 
finding the particle anywhere in space, at time t, is equal to 1: 

|d^(r, t) = 1 (B-9) 

Since d^(r, t) is given by formula (B-4), we conclude that the wave func¬ 
tion i^(r, t) must be square-integrable : 

f Mr. Ol 2 d 3 r is finite (B-10) 


The normalization constant C which appears in (B-4) is then given by the 
relation: 



(we shall later see that the form of the Schrodinger equation implies that C 
is time-independent). One often uses wave functions which are normalized, 
such that: 



(B-12) 


* V(t, t) designates a potential energy here. For example, it may be the product of an electric 
potential and the particle’s charge. In quantum mechanics, K(r, I) is commonly called a potential. 
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The constant C is then equal to 1. 

(«) Note the important difference between the concepts of classical states and 
quantum states. The classical state of a particle is determined at time t by 
the specification of six parameters characterizing its position and its velocity 
at time t: x,y,z ; v x , v y , v z . The quantum state of a particle is determined by an 
infinite number of parameters : the values at the various points in space of the 
wave function ^(r, /) which is associated with it. For the classical idea of a tra¬ 
jectory (the succession in time of the various states of the classical particle), 
we must substitute the idea of the propagation of the wave associated with 
the particle. Consider, for example, Young’s double-slit experiment, previously 
described for the case of photons, but which in principle can also be performed 
with material particles such as electrons. When the interference pattern is 
observed, it makes no sense to ask through which slit each particle has passed, 
since the wave associated with it passed through both. 

(ih) It is worth noting that, unlike photons, which can be emitted or absorbed 
during an experiment, material particles can neither be created nor destroyed. 
The electrons emitted by a heated filament for example already existed in the 
filament. In the same way, an electron absorbed by a counter does not 
disappear; it becomes part of an atom or an electric current. Actually, the 
theory of relativity shows that it is possible to create and annihilate material 
particles: for example, a photon having sufficient energy, passing near an 
atom, can materialize into an electron-positron pair. Inversely, the positron, 
when it collides with an electron, annihilates with it, emitting photons. 
However, we pointed out in the beginning of this chapter that we would 
limit ourselves here to the non-relativistic quantum domain, and we have 
indeed treated time and space coordinates asymmetrically. In the framework 
of non-relativistic quantum mechanics, material particles can neither be created 
nor annihilated. This conservation law, as we shall see, plays a role of primary 
importance. The need to abandon it is one of the important difficulties encoun¬ 
tered when one tries to construct a relativistic quantum mechanics. 


C. QUANTUM DESCRIPTION OF A PARTICLE. 

WAVE PACKETS 

In the preceding paragraph, we introduced the fundamental concepts which are 
necessary for the quantum description of a particle. In this paragraph, we are going 
to familiarize ourselves with these concepts and deduce from them several very 
important properties. Let us begin by studying a very simple special case, that of 
a free particle. 


1. Free particle 

Consider a particle whose potential energy is zero (or has a constant value) 
at every point in space. The particle is thus not subjected to any force; it is said to be 
free. 
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When V(r, t) = 0, the Schrodinger equation becomes: 
ft ^(r, t) = - ^ dtA(r, t) (C-l) 

This differential equation is obviously satisfied by solutions of the form: 


^(r, t) = A e Mk - r- “° 


(C-2) 


(where ^ is a constant), on the condition that k and co satisfy the relation: 
hk 2 


a> = 


2m 


(C-3) 


Observe that, according to the de Broglie relations [see (B-2)], condition (C-3) 
expresses the fact that the energy E and the momentum p of a free particle satisfy the 
equation, which is well-known in classical mechanics: 


E 



(C-4) 


We shall come back later (§ C-3) to the physical interpretation of a state of the 
form (C-2). We already see that, since 

|<A(r, 0| 2 = Ml 2 (C-5) 

a plane wave of this type represents a particle whose probability of presence is 
uniform throughout all space (see comment below). 

The principle of superposition tells us that every linear combination of plane 
waves satisfying (C-3) will also be a solution of equation (C-l). Such a superposition 
can be written: 


iMr, 0 


_ 1 _ 

(2tt) 3 ' 2 


I 


g{ k) e i "‘ r -“ (t> ' l d 3 A: 


(C-6) 


(d 3 k represents, by definition, the infinitesimal volume element in k-space: 
dk x dk y dk z ). <?(k), which can be complex, must be sufficiently regular to allow dif¬ 
ferentiation inside the integral. It can be shown, moreover, that any square-integrable 
solution can be written in the form (C-6). 

A wave function such as (C-6), a superposition of plane waves, is called 
a three-dimensional " wave packet”. For the sake of simplicity, we shall often be led 
to study the case of a one-dimensional wave packet*, obtained from the superposition 
of plane waves all propagating parallel to Ox. The wave function then depends 
only on x and t: 


\p(x, t) 


1 

V 2t z 



g(k) e ,,k ' t ““ ( * 1) ' 1 dk 


(C-7) 


* A simple model of a two-dimensional wave packet is presented in complement E r Some general 
properties of three-dimensional wave packets are studied in complement F„ which also shows how, 
in certain cases, a three-dimensional problem can be reduced to several one-dimensional problems. 
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In the following paragraph, we shall be interested in the form of the wave packet 
at a given instant. If we choose this instant as the time origin, the wave function 
is written: 

0) = — \= j g(k ) e ikx d k (C-8) 

v 2n J 

We see thatg(k) is simply the Fourier transform (cf. appendix I) of ij/(x, 0) : 

g(k) = —L= f if/(x, 0) e ikx dx (C-9) 

V 2n J 

Consequently, the validity of formula (C-8) is not limited to the case of the free 
particle: whatever the potential, ip(x,0) can always be written in this form. The 
consequences that we shall derive from this in §§ 2 and 3 below are thus perfectly 
general. It is not until § 4 that we shall return explicitly to the free particle. 

comment : 

A plane wave of type (C-2), whose modulus is constant throughout all 
space [cf. (C-5)], is not square-integrable. Therefore, rigorously, it cannot 
represent a physical state of the particle (in the same way as, in optics, 
a plane monochromatic wave is not physically realizable). On the other hand, 
a superposition of plane waves like (C-7) can be square-integrable. 


2. Form of the wave packet at a given time 

The form of the wave packet is given by the x-dependence of tj/(x, 0) defined 
by equation (C-8). Imagine that \g{k)\ has the shape depicted in figure 3; that is, it 
has a pronounced peak situated at k = k 0 and a width (defined, for example, at 
half its maximum value) of Ak. 



FIGURE 3 

Shape of the function \g(k)\ 

[modulus of the Fourier transform 
of i//(*, 0)] : we assume that it is cen¬ 
tered at k = k 0 , where it reaches a 
maximum, and has a width of Ak. 


Let us begin by trying to understand qualitatively the behavior of ip(x, 0) 
through the study of a very simple special case. Let p (x, 0), instead of being the 
superposition of an infinite number of plane waves e ik * as in formula (C-8), be 
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the sum of only three plane waves. The wave vectors of these plane waves 
^ Ale 

are k 0 , k 0 -—, k 0 + —, and their amplitudes are proportional, respectively, to 1, 

1/2 and 1/2. We then have : 


» Kx) = 


g(fep) -ikox 

s/Yn L 





(C-10) 


We see that \^{x )\is maximum when x = 0. This result is due to the fact that, when x 
takes on this value, the three waves are in phase and interfere constructively, as 
shown in figure 4. As one moves away from the value x = 0, the waves become 
more and more out of phase, and |t^(x)| decreases. The interference becomes 
completely destructive when the phase shift between e'* ox and e 1(ko±J * /2)JC is equal 
to ± n : if*(x) goes to zero when x = ± —, Ax being given by: 

Ax . Ak = 4n (C-l 1) 

This formula shows that the smaller the width Ak of the function |<?(&)|, the larger 
the width Ax of the function |^(x)| (the distance between two zeros of |i/'(:c)|). 



The real parts of the three waves whose sum gives the function \j/(x) of (C-10). At x = 0, the three 
waves are in phase and interfere constructively. As one moves away from jc = 0, they go out of phase 
and interfere destructively for x = ± Ax/2. 

In the lower part of the figure, Re { \j/(x) } is shown. The dashed-line curve corresponds 
to the function [ t+cos (# x )] ,which, according to (C-10), gives |^(jc)| (and therefore, the form 
of the wave packet). 
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COMMENT : 

Formula (C-10) shows that |^(jc)| is periodic in x and therefore has a series of 
maxima and minima. This arises from the fact that ij/(x) is the superposition of a finite 
number of waves (here, three). For a continuous superposition of an infinite number of 
waves, as in formula (C-8), such a phenomenon does not occur, and | \j/(x, 0)| can have 
only one maximum. 

Let us now return to the general wave packet of formula (C-8). Its form also 
results from an interference phenomenon: , 0)| is maximum when the different 

plane waves interfere constructively. 

Let oi{k) be the argument of the function g(k ): 

g(k)= |^(/l)| c ia{k) (C-12) 

Assume that a(k ) varies sufficiently smoothly within the interval 


where \g{k)\ is appreciable; then, when Ak is sufficiently small, one can expand a(k) 
in the neighborhood of k = k 0 : 


a(k) ~ a(k 0 ) + (k - /c 0 )[da/dk] k=k(( 

(C-13) 

which enables us to rewrite (C-8) in the form: 


gi[fcox + a(k»)| 

/» + ot 


t//(x, 0) as 

V 271 

\g(k)\ e*'* -'“•><*-*«> d k 

— '1 

(C-14) 

with: 



x 0 = - [da/d/c] k = ko 


(015) 


The form (C-14) is useful for studying the variations of |0)| in terms 
of x. When |x — x 0 | is large, the function of k which is to be integrated oscillates 
a very large number of times within the interval Ak. We then see {cf. fig. 5-a, in 
which the real part of this function is depicted) that the contributions of the 
successive oscillations cancel each other out, and the integral over k becomes 
negligible. In other words, when x is fixed at a value far from x 0 , the phases of the 
various waves which make up t^(x, 0) vary very rapidly in the domain Ak, and these 
waves destroy each other by interference. On the other hand, if x ^ x 0 , the 
function to be integrated over k oscillates hardly at all {cf. fig. 5-b), and |^(x, 0)| is 
maximum. 

The position x M (0) of the center of the wave packet is therefore: 

*m( 0) = *o = - [da/d/c] k = ko (C-16) 

Actually the result (C-16) can be obtained very simply. An integral such as the one 
appearing in (C-8) will be maximum (in absolute value) when the waves having the 
largest amplitude (those with k close to k 0 ) interfere constructively. This occurs when 
the ^-dependent phases of these waves vary only slightly around k — k 0 . To obtain the 
center of the wave packet, one then imposes (stationary phase condition) that the 
derivative with respect to k of the phase is zero for k = k 0 . In the particular case which 
we are studying, the phase of the wave corresponding to k is kx + a(k). Therefore, 
x M (0) is that value of x for which the derivative x + da/d/c is zero at k = k 0 . 
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FIGURE 5 

Variations with respect to k of the function to be integrated over k in order to obtain {//(x, 0). In 
figure (a), x is fixed at a value such that |x — x 0 | > \/Ak, and the function to be integrated oscillates 
several times within the interval Ak . In figure (b), x is fixed such that \x - x 0 \ < I /Ak, and the 
function to be integrated hardly oscillates, so that its integral over k takes on a relatively large value. 
Consequently, the center of the wave packet [point where 0 )| is maximum] is situated at 
x = x 0 . 


When x moves away from the value x 0 , \ij/(x, 0) decreases. This decrease 
becomes appreciable if e i<k-,Io), * -xo) oscillates approximately once when k traverses 
the domain Ak, that is, when: 

Ak. (x — x 0 ) ~ 1 (C-17) 

If Ax is the approximate width of the wave packet, we therefore have : 

Ak .Ax Z 1 (C-18) 

We are thus brought back to a classical relation between the widths of two 
functions which are Fourier transforms of each other. The important fact is 
that the product Ax . Ak has a lower bound; the exact value of this bound clearly 
depends on the precise definition of the widths Ax and Ak. 

A wave packet such as (C-7) thus represents the state of a particle whose 
probability of presence, at the time t = 0, is practically zero outside an interval 
of approximate width Ax centered at the value x 0 . 


comment : 

The preceding argument could lead one to believe that the product Ax. Ak 
is always of the order of 1 [cf. (C-17)]. Let us stress the fact that this is a lower 
limit. Although it is impossible to construct wave packets for which the 
product Ax . Ak is negligible compared to 1, it is perfectly possible to construct 
packets for which this product is as large as desired [see, for example, 
complement G„ especially comment (ii) of §3-c]. This is why (C-18) is written 
in the form of an inequality. 
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3. Heisenberg uncertainty relation 

In quantum mechanics, inequality (C-18) has extremely important physical 
consequences. We intend to discuss these now (we shall stay, for simplicity, within 
the framework of a one-dimensional model). 

We have seen that a plane wave e' ,, ‘ oX ~" 0 ' ) corresponds to a constant proba¬ 
bility density for the particle’s presence along the Ox axis, for all values of t. 
This result can be roughly expressed by saying that the corresponding value 
of Ax is infinite. On the other hand, only one angular frequency a» 0 and one wave 
vector k 0 are involved. According to the de Broglie relations, this means that the 
energy and momentum of the particle are well-defined : E = Tuo 0 and p = hk 0 . 
Such a plane wave can, moreover, be considered to be a special case of (C-7), for 
which g(k) is a « delta function » (appendix II): 

g(k) = S(k - k 0 ) (C-19) 

The corresponding value of Ak is then zero. 

But this property can also be interpreted in the following manner, using the 
principle of spectral decomposition (cf. §§ A-3 and B-2). To say that a particle, 
described at t = 0 by the wave function if/(x, 0) = A e ,kx , has a well-determined 
momentum, is to say that a measurement of the momentum at this time will 
definitely yield p = Tik. From this we deduce that e ,kJC characterizes the eigenstate 
corresponding to p = Tik. Since there exists a plane wave for every real value of k, 
the eigenvalues which one can expect to find in a measurement of the momentum 
on an arbitrary state include all real values. In this case, there is no quantization 
of the possible results : as in classical mechanics, all values of the momentum are 
allowed. 

Now consider formula (C-8). In this formula, ip (x, 0) appears as a linear 
superposition of the momentum eigenfunctions in which the coefficient of e ikx 
is g(k). We are thus led to interpret |#(&)| 2 (to within a constant factor) as the proba¬ 
bility of finding p = hkii one measures, at t = 0, the momentum of a particle whose 
state is described by ip(x, t). In reality, the possible values of p, like those of x, form 
a continuous set, and \g(k)\ 2 is proportional to a probability density : the 
probability d !?(k) of obtaining a value between hk and h(k + dfc) is, to within a 
constant factor, |g(&)| 2 dk. More precisely, if we rewrite formula (C-8) in the form: 

iP(x, 0) = —L= I* «A(p) e ipx/h dp (C-20) 

V 2nh J 

we know that i p(p) and ip (x, 0) satisfy the Bessel-Parseval relation (appendix I): 
f |<A(x, 0)| 2 dx = j* |^(p)| 2 dp (C-21) 

If the common value of these integrals is C, d^(x) = \ip(x, 0)| 2 dx is the probability 
of the particle being found, at t = 0, between x and x + dx. In the same way: 

d^(p) = \ij/(p)\ 2 dp (C-22) 
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is the probability that the measurement of the momentum will yield a result included 
between p and p + dp [relation (C-21) then insures that the total probability of 
finding any value is indeed equal to 1]. 

Now let us go back to the inequality (C-18). We can write it as : 

Ax. Ap ;> h (C-23) 

(Ap = TiAk is the width of the curve representing |i//(p)|). Consider a particle whose 
state is defined by the wave packet (C-20). We know that its position probability 
at t = 0, is appreciable only within a region of width Ax about x 0 : its position is 
known within an uncertainty Ax. If one measures the momentum of this particle at 

the same time, one will find a value between p 0 + 4^ and p 0 — 4p, since \*P(p)\ 2 

is practically zero outside this interval : the uncertainty in the momentum is 
therefore Ap. The interpretation of relation (C-23) is then the following : it is 
impossible to define at a given time both the position of the particle and its momen¬ 
tum to an arbitrary degree of accuracy. When the lower limit imposed by (C-23) 
is reached, increasing the accuracy in the position (decreasing Ax) implies that the 
accuracy in the momentum diminishes (Ap increases), and vice versa. This relation 
is called the Heisenberg uncertainty relation. 

We know of nothing like this in classical mechanics. The limitation expressed 
by (C-23) arises from the fact that h is not zero. It is the very small value of h on the 
macroscopic scale which renders this limitation totally negligible in classical 
mechanics (an example is discussed in detail in complement B,). 

comment : 

The inequality (C-18) with which we started is not an inherently quantum 
mechanical principle. It merely expresses a general property of Fourier transforms, 
numerous applications of which can be found in classical physics. For example, it is 
well known from electromagnetic theory that there exists no train of electromagnetic 
waves for which one can define the position and the wavelength with infinite accuracy 
at the same time. Quantum mechanics enters in when one associates a wave with a 
material particle and requires that the wavelength and the momentum satisfy de Broglie’s 
relation. 


4. Time evolution of a free wave packet 


Until now, we have been concerned only with the form of a wave packet 
at a given instant; in this paragraph, we are going to study its time evolution. Let 
us return, therefore, to the case of a free particle whose state is described by the 
one-dimensional wave packet (C-7). 

A given plane wave e‘ <kJC “ propagates along the Ox axis with the velocity: 



(C-24) 


since it depends on x and t only through 



V^k) is called the phase 


velocity of the plane wave. 
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We know that in the case of an electromagnetic wave propagating in a vacuum, 
is independent of k and equal to the speed of light c. All the waves which make up 
a wave packet move at the same velocity, so that the packet as a whole also moves 
with the same velocity, without changing in shape. On the other hand, we know that 
this is not true in a dispersive medium, where the phase velocity is given by : 


VJLk) = 


n(k) 


(C-25) 


n(k) being the index of the medium, which varies with the wavelength. 

The case that we are considering here corresponds to a dispersive medium, 
since the phase velocity is equal to \cf. equation (C-3)]: 



(C-26) 


We shall see that when the different waves thus have unequal phase velocities, the 
velocity of the maximum x M of the wave packet is not the average phase 


velocity ^ contrary to what one might expect. 

As we did before, we shall begin by trying to understand qualitatively what 
happens, before taking a more general point of view. Therefore, let us return to 
the superposition of three waves considered in § C-2. For arbitrary t, ip(x, t) is 
given by: 


^(x, t ) = 


dikp ) 

V 2n 


gi[kox-<oot] 


1 < 
+ 2 e 






g(fc 0 ) 

\i 2n 





+ 




(C-27) 


We see, therefore, that the maximum of |i^(x, f)|, which was at x = 0 at t = 0, is 
now at the point: 





(C-28) 


and not at the point x = — 2 1. The physical origin of this result appears in figure 6. 

Kq 

Part a) of this figure represents the position at time t = 0 of three adjacent 
maxima (I), (2), (3), for the real parts of each of the three waves. Since the maxima 
denoted by the index (2) coincide at x = 0, there is constructive interference at this 
point, which thus corresponds to the position of the maximum of |i//(x, 0)|. Since 
the phase velocity increases with k [formula (C-26)], the maximum (3) of the 


wave 


(*•+t) wi 


will gradually catch up with that of the wave {k 0 ), which will 


in turn catch up with that of the wave 


(*»-#> 


After a certain time, we shall thus 


have the situation shown in figure 6 -b: it will be the maxima (3) which coincide and 
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thus determine the position of the maximum x M (f) of \il*(x, t)\. We clearly see 


in the figure that x M (f) is not equal to — f, and a simple calculation again 

k 0 

yields (C-28). 


Ak I 

fco +— 1(1) 

1(2) 1(3) 

1 

(1) 1(2) 

1(3) 

*o |(1) 

1(2) 1(3) 

1(0 

1(2) 

1(3) 

k —1 

k ° 2 M) 

1(2) 1(3) 

1(0 

1(2) 

1(3) 


*m(°) b x M (t) 


FIGURE 6 

Positions of the maxima of the three waves of figure 4 at time t = 0 (fig. a) and at a subsequent t 
(fig. b). At time / = 0, it is the maxima (2), situated at x = 0, which interfere construedveiy : the 
position of the center of the wave packet is x M (0) = 0. At time t, the three waves have advanced 
with different phase velocities V 9 . It is then the maxima (3) which interfere constructively and the 
center of the wave packet is situated at x = x M (t ). We thus see that the velocity of the center of the 
wave packet (group velocity) is different from the phase velocities of the three waves. 


The shift of the center of the wave packet (C-7) can be found in an analogous 
fashion, by applying the "stationary phase” method. It can be seen from the 
form (C-7) of the free wave packet that, in order to go from >J/(x, 0) to if/ (x, t ), all 
we need to do is change g(k) to g(k) e -to(k) '. The reasoning of § C-2 thus remains 
valid, on the condition that we replace the argument a(k) of g(k) by: 


a (k) — a)(k)t 


The condition (C-16) then gives: 


*m(0 = 



(C-29) 


(C-30) 


We are thus brought back to result (C-28) : the velocity of the maximum of 
the wave packet is : 


V G {k 0 ) — 



(C-31) 


V G (k 0 ) is called the group velocity of the wave packet. With the dispersion relation 
given in (C-3), we obtain: 

Wo)=^ = 2 W < C - 32 > 


30 




D. TIME-INDEPENDENT SCALAR POTENTIAL 


This result is important, for it enables us to retrieve the classical description of the 
free particle, for the cases where this description is valid. For example, when one 
is dealing with a macroscopic particle (and the example of the dust particle discussed 
in complement B, shows how small it can be), the uncertainty relation does not 
introduce an observable limit on the accuracy with which its position and momen¬ 
tum are known. This means that we can construct, in order to describe such 
a particle in a quantum mechanical way, a wave packet whose characteristic 
widths Ax and Ap are negligible. We would then speak, in classical terms, of the 
position x M (t) and the momentum p Q of the particle. But then its velocity must 

be i? = . This is indeed what is implied by formula (C-32), obtained in the 

m 

quantum description : in the cases where Ax and Ap can both be made negligible, 
the maximum of the wave packet moves like a particle which obeys the laws of 
classical mechanics. 

comment : 

We have stressed here the motion of the center of the free wave packet. 
It is also possible to study the way in which its form evolves in time. It is then 
easy to show that, if the width Ap is a constant of the motion, Ax varies 
over time and, for sufficiently long times, increases without limit (spreading 
of the wave packet). The discussion of this phenomenon is given in 
complement G,, where the special case of a Gaussian wave packet is treated. 


D. PARTICLE IN A TIME-INDEPENDENT SCALAR POTENTIAL 

We have seen, in § C, how the quantum mechanical description of a particle reduces 
to the classical description when Planck’s constant h can be considered to be 
negligible. In the classical approximation, the wavelike character does not appear 

h 

because the wavelength A = — associated with the particle is much smaller than 

P 

the characteristic lengths of its motion. This situation is analogous to the one encoun¬ 
tered in optics. Geometrical optics, which ignores the wavelike properties of light, 
constitutes a good approximation when the corresponding wavelength can be 
neglected compared to the lengths with Which one is concerned. Classical mechanics 
thus plays, with respect to quantum mechanics, the same role played by geometrical 
optics with respect to wave optics. 

In this paragraph, we are going to be concerned with a particle in a time- 
independent potential. What we have just said implies that typically quantum 
effects (that is, those of wave origin) should arise when the potential varies 
appreciably over distances shorter than the wavelength, which cannot then be 
neglected. This is why we are going to study the behavior of a quantum particle 
placed in various "square potentials”, that is, "step potentials”, as shown in 
figure 7-a. Such a potential, which is discontinuous, clearly varies considerably over 
intervals of the order of the wavelength, however small it is : quantum effects must 
therefore always appear. Before beginning this investigation, we shall discuss some 
important properties of the Schrodinger equation when the potential is not time- 
dependent. 
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1. Separation of variables. Stationary states 

The wave function of a particle whose potential energy L(r) is not time- 
dependent must satisfy the Schrodinger equation : 

Mr, t) = - d^(r, t) + L(r) i//(r, t) (D-l) 

a. EXISTENCE OF STATIONARY STATES 

Let us see if there exist solutions of this equation of the form: 

Hr, 0 = <jo(r) x(t ) (D-2) 

Substituting (D-2) into (D-l), we obtain : 

M<p{r)^f = + X(t)V(r) cp(r) (D-3) 

If we divide both sides by the product (p(r)x(t), we find : 

tit d x(t) if ft 1 . , ."1 ... . . 

io“dT - m [- IS ^ (r) J + m (EM) 

This equation equates a function of t only (left-hand side) and a function of r only 
(right-hand side). This equality is only possible if each of these functions is in fact 
a constant, which we shall set equal to ha>, where co has the dimensions of an 
angular frequency. 

Setting the left-hand side equal to hco, we obtain for x(t) a differential 
equation which can easily be integrated to give: 

X(t) = A (D-5) 

In the same way, (p( r) must satisfy the equation: 

- A(p(r) + K(r) <p(r) = Tuo <p(r) (D-6) 

If we set A = 1 in equation (D-5) [which is possible if we incorporate, for example, 
the constant A in <p(r)], we achieve the following result: the function 

*(r,f) = rtrje- 1 -* (D-7) 

is a solution of the Schrodinger equation, on the condition that <p(r) is a solution 
of (D-6). The time and space variables are said to have been separated. 

A wave function of the form (D-7) is called a stationary solution of the 
Schrodinger equation : it leads to a time-independent probability den¬ 
sity |^(r, t) | 2 = |q?(r)| 2 . In a stationary function, only one angular frequency co 
appears; according to the Planck-Einstein relations, a stationary state is a state 
with a well-defined energy E — h(o (energy eigenstate). In classical mechanics, when 
the potential energy is time-independent, the total energy is a constant of the 
motion; in quantum mechanics, there exist well-determined energy states. 
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Equation (D-6) can therefore be written: 
lin A + = E<P(r) 


or: 


H<p(r) =E<p( r) 


where H is the differential operator : 


H = — — A + V(r) 
2m 


(D-8) 


(D-9) 


(D-10) 


H is a linear operator since, if X t and X 2 are constants, we have : 

H^cpfr) + A 2 <iM r )] = ^i/f<Pi(r) + X 2 H (p 2 (r) (D-ll) 

Equation (D-9) is thus the eigenvalue equation of the linear operator H : the 
application of H to the « eigenfunction » (p(r) yields the same function, multiplied 
by the corresponding « eigenvalue » E. The allowed energies are therefore the 
eigenvalues of the operator H. We shall see later that equation (D-9) has square- 
integrable solutions <p(r) only for certain values of E (cf § D-2-c and § 2-c of 
complement H,) : this is the origin of energy quantization. 

comment: 

Equation (D-8) [or (D-9)] is sometimes called the “time-independent Schrodinger 
equation”, as opposed to the “time-dependent Schrodinger equation” (D-l). We stress 
their essential difference : equation (D-l) is a general equation which gives the evolution 
of the wave function, whatever the state of the particle; on the other hand, the eigenvalue 
equation (D-9) enables us to find, amongst all the possible states of the particle, those 
which are stationary. 


b. SUPERPOSITION OF STATIONARY STATES 

In order to distinguish between the various possible values of the energy E 
(and the corresponding eigenfunctions <p(r)), we label them with an index n. 
Thus we have: 

H<P„{r) = E n (p„( r) (D-l 2) 

and the stationary states of the particle have as wave functions : 

^ B (r, 0 = <P,,(r)e~ i£ "' /S (D-13) 

if/ n (r,t) is a solution of the Schrodinger equation (D-l). Since this equation is 
linear, it has a whole series of other solutions of the form 

*Kr, 0 = Z c n (p n (r) e ~ iEn,lh (D-14) 
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where the coefficients c n are arbitrary complex constants. In particular, we have: 

Hr, 0) = I c tt <p H (r) (D-15) 

n 

Inversely, assume that we know ij/(r, 0), that is, the state of the particle 
at t — 0. We shall see later that any function i^(r, 0) can always be decomposed 
in terms of eigenfunctions of H, as in (D-15). The coefficients c n are therefore 
determined by 0). The corresponding solution t) of the Schrodinger 
equation is then given by (D-14). All we need to do to obtain it is to multiply each 
term of (D-15) by the factor e -<E “' /ft , where E n is the eigenvalue associated with <p n (r). 
We stress the fact that these phase factors differ from one term to another. It is 
only in the case of stationary states that the /-dependence involves only one 
exponential [formula (D-13)]. 


2. One-dimensional ’’square" potentials. 

Qualitative study 

We said at the beginning of § D that in order to display quantum effects we 
were going to consider potentials which varied considerably over small distances. 
We shall limit ourselves here to a qualitative study, so as to concentrate on the 
simple physical ideas. A more detailed study is presented in the complements of 
this chapter (complement H,). To simplify the problem, we shall consider a 
one-dimensional model, in which the potential energy depends only on x (the 
justification for such a model is given in complement F,). 

a. PHYSICAL MEANING OF A SQUARE POTENTIAL 

We shall consider a one-dimensional problem with a potential of the type 
shown in figure 7-a. The Ox axis is divided into a certain number of constant- 
potential regions. At the border of two adjacent regions the potential makes an 
abrupt jump (discontinuity). Actually, such a function cannot really represent a 
physical potential, which must be continuous. We shall use it to represent schema¬ 
tically a potential energy F(x) which actually has the shape shown in figure 7-b : 
there are no discontinuities, but V(x) varies very rapidly in the neighborhood of 
certain values of x. When the intervals over which these variations occur are much 
smaller than all other distances involved in the problem (in particular, the wave¬ 
length associated with the particle), we can replace the true potential by the square 
potential of figure 7-a. This is an approximation, which would cease to be valid, for 
example, for a particle having too high an energy, whose wavelength would be 
very short. 

The predictions of classical mechanics concerning the behavior of a particle 
in a potential such as that of figure 7 are easy to determine. For example, imagine 
that K(x) is the gravitational potential energy. Figure 7-b then represents the real 
profile of the terrain on which the particle moves : the corresponding disconti¬ 
nuities are sharp slopes, separated by horizontal plateaus. Notice that, if we fix the 
total energy E of the particle, the domains of the Ox axis where V > E are forbidden 
to it (its kinetic energy E k = E — V must be positive). 
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“Square” 
potential 

0 

Real 
potential 

0 

FIGURE 7 

Square potential (fig. a) which 
c schematically represents a real 
potential (fig. b) for which the 
force has the shape shown in 
figure c. 




COMMENT : 

dVlx) 

The force exerted on the particle is F(x) =--r^-2. In figure 7-c, we 

have depicted this force, obtained from the potential V (x) of figure 7-b. It can 
be seen that this particle, in all the regions where the potential is constant, 
is not subject to any force. Its velocity is then constant. It is only in the 
frontier zones between these plateaus that a force acts on the particle and, 
depending on the case, accelerates it or slows it down. 

b. OPTICAL ANALOGY 

We are going to consider the stationary states (§ D-l) of a particle in a one¬ 
dimensional “square” potential. 

In a region where the potential has a constant value V, the eigenvalue 
equation (D-9) is written : 

f ft 2 d 2 1 

- — —2 + V <p(x) = E<p(x) (D-16) 

L 2 m dx 

or: 

~ (£ — V) \ <^(x) = 0 (D-17) 

dx « 

Now, in optics, there exists a completely analogous equation. Consider a 
transparent medium whose index n depends neither on r nor on time. In this 
medium, there can be electromagnetic waves whose electric field E(r, /) is inde¬ 
pendent of y and z and has the form: 

E(r, t) = e£(x) e iQ ' (D-l 8) 


35 




CHAPTER I WAVES AND PARTICLES 


where e is a unit vector perpendicular to Ox. E(x) must then satisfy: 
f d’ n 2 Q 2 l u . 

_r )=0 

We see that equations (D-17) and (D-19) become identical if we set: 


|?(£- V)=^f 

ft 2 c 


(D-19) 


(D-20) 


Moreover, at a point x where the potential energy V [and, consequently, 
the index n given by (D-20)] is discontinuous, the boundary conditions for <p(x) 
and E(x) are the same : these two functions, as well as their first derivatives, must 
remain continuous (cf. complement H,, § 1-b). The structural analogy between 
the two equations (D-17) and (D-19) thus enables us to associate with a quantum 
mechanical problem, corresponding to the potential of figure 7-a, an optical 
problem : the propagation of an electromagnetic wave of angular frequency Q in 
a medium whose index n has discontinuities of the same type. According to (D-20), 
the relation between the optical and mechanical parameters is : 

n(Q) = ^ ^/2mc 2 (E - V) (D-21) 


For the light wave, a region where E > V corresponds to a transparent 
medium whose index is real. The wave is then of the form e'**. 

What happens when V > El Formula (D-20) gives a pure imaginary index. 
In (D-19), n 2 is negative and the solution is of the form e '"*: it is the analogue of 
an “evanescent wave”. Certain aspects of the situation recall the propagation 
of an electromagnetic wave in a metallic medium*. 

Thus we can transpose the well-known results of wave optics to the problems 
which we are studying here. It is important, however, to realize that this is merely 
an analogy. The interpretation that we give for the wave function is fundamentally 
different from that which classical wave optics attributes to the electromagnetic 
wave. 


c. EXAMPLES 

<x Potential step and barrier 

Consider a particle of energy E which, coming from the region of negative x, 
arrives at the potential “step” of height V 0 shown in figure 8. 

If E > V 0 , (the case in which the classical particle clears the potential step 
and continues towards the right with a smaller velocity), the optical analogy is the 
following : a light wave propagates from left to right in a medium of index n l : 

(D-22) 


* This analogy should not be pushed too far, since the index n of a metallic medium has both a 
real and a complex part (in a metal, an optical wave continues to oscillate as it damps out). 
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FIGURE 8 

Potential step. 


at x = x l9 there is a discontinuity, and the index, for x > x i9 is : 

"2 = lQ' j2m{E ~ Ko) (D_23) 

We know that the incident wave coming from the left splits into a reflected wave 
and a transmitted wave. Let us transpose this result to quantum mechanics : the 
particle has a certain probability 0 of being reflected, and only the probability 1 — 0 
of pursuing its course towards the right. This result is contrary to what is predicted 
by classical mechanics. 

When E < V Q , the index n 2 , which corresponds to the region x > x u 
becomes pure imaginary, and the incident light wave is totally reflected. The 
quantum prediction therefore coincides at this point with that of classical mechanics. 
Nevertheless, the existence, for x > x 2 , of an evanescent wave, shows that the 
quantum particle has a non-zero probability of being found in this region. 

The role of this evanescent wave is more striking in the case of a potential 
barrier (fig. 9). For E < V 0 , a classical particle would always turn back. But, in the 
corresponding optical problem, we would have a layer of finite thickness, with an 
imaginary index, surrounded by a transparent medium. If this thickness is not much 
greater than the range 1/p of the evanescent wave, part of the incident wave is 
transmitted into the region x > x 2 . Therefore, even for E < V 0 , we find a non¬ 
zero probability of the particle crossing the barrier. This is called the “tunnel effect”. 


\V(x) 


K 


0 


*1 X 2 


*-x 


FIGURE 9 

Potential barrier. 


(3 Potential well 

The function V(x) now has the form shown in figure 10. The predictions of 
classical mechanics are the following : when the particle has a negative energy E 
(but greater than — V 0 ), it can only oscillate between x 2 and x 2 , with kinetic 
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energy E k = E + V 0 ; when the particle has a positive energy and arrives from the 
left, it undergoes an abrupt acceleration at x^ then an equal deceleration at x 2 , and 
then continues towards the right. 

In the optical analogue of the case — V Q < E < 0, the indices n 1 and w 3 , which 
correspond to the regions x < Xj and x > x 2 , are imaginary, while the index « 2 , 
which characterizes the interval [x l5 x 2 ], is real. Thus we have the equivalent of 



FIGURE 10 
Potential well. 


a layer of air, for example, between two reflecting media. The different waves 
reflected successively at x, and x 2 destroy each other through interference, except for 
certain well-determined frequencies (“normal modes”) which allow stable station¬ 
ary waves to be established. From the quantum point of view, this implies that 
the negative energies are quantized *, while, classically, all values included between 
— V 0 and 0 are possible. 

For E > 0, the indices n 2 and n 3 are real: 

«i = « 3 (D-24) 

» 2 = ^N/2m(£ + Vj (D-25) 

Since n 2 is greater than and « 3 , the situation is analogous to that of a layer of glass 
in air. In order to obtain the reflected wave for x < x,, or the transmitted wave in 
the region x > x 2 , it is necessary to superpose an infinite number of waves 
which arise from successive reflections at x, and x 2 (multiple wave interferometer 
analogous to a Fabry-Perot). We then find that, for certain incident frequencies, 
the wave is entirely transmitted. From the quantum point of view, the particle thus 
has, in general, a certain probability of being reflected. However, there exist energy 
values, called resonance energies , for which the probability of transmission is 1 and, 
consequently, the probability of reflection is 0. 

These few examples show how much the predictions of quantum mechanics 
can differ from those of classical mechanics. They also clearly stress the primordial 
role of potential discontinuities (which represent, schematically, rapid variations). 


* The allowed energy values are not given by the well-known condition : x 2 — x, = kk 2 ! 2, for 
it is necessary to take into account the existence of evanescent waves, which introduce a phase shift upon 
reflection at * = x t and x = x 2 (cf. complement H h § 2-c). 
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CONCLUSION 

In this chapter, we have introduced and discussed, in a qualitative and 
intuitive manner, certain fundamental ideas of quantum mechanics. We shall later 
return to these ideas (chap. Ill) so as to present them in a more precise and syste¬ 
matic way. Nevertheless, it is already clear that the quantum description of physical 
systems differs radically from the one given by classical mechanics (although the 
latter constitutes, in numerous cases, an excellent approximation). We have limited 
ourselves in this chapter to the case of physical systems composed of only one 
particle. The description of these systems at a given time is, in classical mechanics, 
founded on the specification of six parameters, which are the components of the 
position r(r) and the velocity v(f ) of the particle. All the dynamical variables 
(energy, linear momentum, angular momentum) are determined by the specifi¬ 
cation of r(f) and \(t). Newton’s laws enable us to calculate r(t) through the 
solution of second-order differential equations with respect to time. Consequently, 
they fix the values of r(/) and v(/) for any time t when they are known for the 
initial time. 

Quantum mechanics uses a more complicated description of phenomena. 
The dynamic state of a particle, at a given time, is characterized by a wave function. 
It no longer depends on only six parameters, but on an infinite number [the values 
of i^(r, t) at all points r of space]. Moreover, the predictions of the measurement 
results are now only probabilistic (they yield only the probability of obtaining 
a given result in the measurement of a dynamical variable). The wave function is 
a solution of the Schrodinger equation, which enables us to calculate ^(r, t) 
from i^(r, 0). This equation implies a principle of superposition which leads to 
wave effects. 

This upheaval in our conception of mechanics was imposed by experiment. 
The structure and behavior of matter on an atomic level are incomprehensible 
in the framework of classical mechanics. The theory has thereby lost some of its 
simplicity, but it has gained a great deal of unity, since matter and radiation are 
described in terms of the same general scheme (wave-particle duality). We stress the 
fact that this general scheme; although it runs counter to our ideas and habits drawn 
from the study of the macroscopic domain, is perfectly coherent. No one has ever 
succeeded in imagining an experiment which could violate the uncertainty prin¬ 
ciple (cf. complement D, of this chapter). In general, no observation has, to date, 
contradicted the fundamental principles of quantum mechanics. Nevertheless, at 
present, there is no global theory of relativistic and quantum phenomena, and 
nothing, of course, prevents the possibility of a new upheaval. 


References -' r d suggestions for further reading : 


Description of physical phenomena which demonstrate the necessity of introducing 
quantum mechanical concepts: see the subsection “Introductory work - quantum 
physics” of section 1 of the bibliography; in particular, Wichmann (1.1) and 
Feynman III (1.2), chaps. 1 and 2. 

History of the development of quantum mechanical concepts : references of 
section 4 of the bibliography, in particular, Jammer (4.8); also see references (5.11) 
and (5.12), which contain numerous references to the original articles. 
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Fundamental experiments: references to the original articles can be found in 
section 3 of the bibliography. 

The problem of interpretation in quantum mechanics : section 5 of the biblio¬ 
graphy; in particular, the “Resource Letter” (5.11), which contains many classified 
references. 

Analogies and differences between matter waves and electromagnetic waves : 
Bohm (5.1), chap. 4; in particular, the table “ Summary on Probabilities” at the end 
of the chapter. 

See also the articles by Schrodinger (1.25), Gamow (1.26), Bom and Biem (1.28), 
Scully and Sargent (1.30). 
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COMPLEMENTS OF CHAPTER I 


A, : ORDER OF MAGNITUDE 

OF THE WAVELENGTHS ASSOCIATED 
WITH MATERIAL PARTICLES 

B, : CONSTRAINTS IMPOSED 

BY THE UNCERTAINTY RELATIONS 

C, : THE UNCERTAINTY RELATIONS 

AND ATOMIC PARAMETERS 


D,: AN experiment illustrating D,: discussion of a simple thought experiment 

THE uncertainty relation which attempts to invalidate the complementa¬ 

rity between the particle and wave aspects of 
light (easy, but could be reserved for subsequent 
study). 


Ej, F„ G, : complements on wave packets 
(§ C of chapter I) 

E, : reveals in a simple, qualitative way the 
relation which exists between the lateral exten¬ 
sion of a two-dimensional wave packet and the 
angular dispersion of wave vectors (easy). 

F, : generalization to three dimensions of 
the results of § C of chapter I; shows how 
the study of a particle in three-dimensional 
space can, in certain cases, be reduced to one¬ 
dimensional problems (a little more difficult). 

G, : treats in detail a special case of wave 
packets for which one can calculate exactly 
the properties and the evolution (some difficulties 
in the calculation, but conceptually simple). 

H,: takes up in a more quantitative way the 
ideas of § D-2 of chapter I. Strongly recommen¬ 
ded, since square potentials are often used to 
illustrate simply the implications of quantum 
mechanics (numerous complements and exer¬ 
cises proposed later in this book rely on the 
results of H,). 

j, : behavior OF a wave packet J,: more precise study, for a special case, of the 

AT A POTENTIAL STEP quantum behavior of a particle in a square 

potential. Since the particle is sufficiently well- 
localized in space (wave packet), one can fol¬ 
low its “motion” (average difficulty; important 

K,: EXERCISES for the physical interpretation of the results). 


E, : A SIMPLE TREATMENT 

OF A TWO-DIMENSIONAL 
WAVE PACKET 

F, : THE RELATION BETWEEN ONE- 

AND THREE-DIMENSIONAL PROBLEMS 

G, : ONE-DIMENSIONAL GAUSSIAN 

WAVE PACKET ; 

SPREADING OF THE WAVE PACKET 


H,: STATIONARY STATES 

OF A PARTICLE IN ONE-DIMENSIONAL 
SQUARE POTENTIALS 


A h B,, C,: very simple but fundamental reflec¬ 
tions on the order of magnitude of quantum 
parameters 
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Complement Ai 

ORDER OF MAGNITUDE OF THE WAVELENGTHS 
ASSOCIATED WITH MATERIAL PARTICLES 


De Broglie’s relation: 


a) 


shows that, for a particle of mass m and speed v, the smaller m and v, the longer 
the corresponding wavelength. 

To show that the wave properties of matter are impossible to detect in the 
macroscopic domain, take as an example a dust particle, of diameter 1 n and 
mass m ca 10“ 15 kg. Even for such a small mass and a speed of » ^ 1 mm/s, 
formula (1)gives: 


X ~ 


6.6 x 10“ 34 


10 


-15 


X 10 


meter = 6.6 x 10 16 meter = 6.6 x 10 6 A 


( 2 ) 


Such a wavelength is completely negligible on the scale of the dust particle. 

Consider, on the other hand, a thermal neutron, that is, a neutron 
(m„ a* 1.67 x 10" 27 kg) with a speed v corresponding to the average thermal 
energy at the (absolute) temperature T. v is given by the relation: 


1 

2 m » V 


2 



(3) 


where k is the Boltzmann constant (k ~ 1.38 x 10 23 joule/degree). The wave¬ 
length which corresponds to such a speed is: 


n /- ' V 

P v 3m n kT 
For T 2 * 300 °K, we find : 

X a 1.4 A (5) 

that is, a wavelength which is of the order of the distance between atoms in a 
crystal lattice. A beam of thermal neutrons falling on a crystal therefore gives 
rise to diffraction phenomena analogous to those observed with X-rays. 

Let us now examine the order of magnitude of the de Broglie wavelengths 
associated with electrons (m e ~ 0.9 x 10" 30 kg). If one accelerates an electron 
beam through a potential difference V (expressed in volts), one gives the electrons 
a kinetic energy : 

E=qV= 1.6 x 10" 19 V joule (6) 

p 2 

(q — 1.6 x 10 19 coulomb is the electron charge.) Since E = the associated 

2m e 

wavelength is equal to : 
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A = — = 


v 2m„E 


that is, numerically: 


6.6 x 1(T 34 


2 x 0.9 x 1(T 30 x 1.6 x 10 -19 V 


meter 


With potential differences of several hundreds of volts, one again obtains wave¬ 
lengths comparable to those of X-rays, and electron diffraction phenomena can be 
observed with crystals or crystalline powders. 

The large accelerators which are currently available are able to impart 
considerable energy to particles. This takes us out of the non-relativistic 
domain to which we have thus far confined ourselves. For example, electron 
beams are easily obtained for which the energy exceeds 1 GeV* = 10 9 eV 
(1 eV = 1 electron-volt = 1.6 x 10“ 19 joule), while the electron rest mass is 
equal to m e c 2 ^0.5 x 10 6 eV. This means that the corresponding speed is very 
close to the speed of light c. Consequently, the non-relativistic quantum mechanics 
which we are studying here does not apply. However, the relations: 

E = hv (9-a) 

A = - (9-b) 

P 

remain valid in the relativistic domain. On the other hand, relation (7) must be 
modified since, relativistically, the energy £ of a particle of rest mass m 0 is 
no longer p 2 /2m 0 , but instead: 

E = \f p 2 c 2 + mlc A (10) 

In the example considered above (an electron of energy 1 GeV), m e c 2 is 

negligible compared to E, and we obtain: 

he 6.6 x 10 -34 x 3 x 10 8 ,a-is , ~ c ■ 

A ~ — =-m = 1.2 x 10 15 m = 1.2 fermi (11) 

4 . . 4a — in ' ' 


1.6 x 10- 10 


(1 fermi = 10 -15 m). With electrons accelerated in this way, one can explore 
the structure of atomic nuclei and, in particular, the structure of the proton; 
nuclear dimensions are of the order of a fermi. 


comments : 

( 1 ) We want to point out a common error in the calculation of the wave¬ 
length of a material particle of mass m 0 ^ 0, whose energy E is known. 
This error consists of calculating the frequency v using (9-a) and, then, 
by analogy with electromagnetic waves, of taking c/v for the de Broglie 

* Translator’s note : In the United States, this unit is sometimes written BeV. 
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wavelength. Obviously, the correct reasoning consists of calculating, for 
example from (10) ( or, in the non-relativistic domain, from the 


relation E = 4— 
2m 


the momentum p associated with the energy E and then 


using (9-b) to find A. 

(ii) According to (9-a), the frequency v depends on the origin chosen for the 
energies. The same is true for the phase velocity = T = Note, 

on the other hand, that the group velocity V G = ^ = 27t-^ does not 

depend on the choice of the energy origin. This is important in the 
physical interpretation of V G . 


References and suggestions for further reading: 

Wichmann (1.1), chap. 5; Eisberg and Resnick (1.3), § 3.1. 
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Complement B, 

CONSTRAINTS IMPOSED BY THE UNCERTAINTY RELATIONS 


1. Macroscopic system 

2. Microscopic system 

We saw in § C-3 of chapter I that the position and momentum of a particle 
cannot be simultaneously defined with arbitrary precision : the corresponding 
uncertainties Ax and Ap must satisfy the uncertainty relation: 

Ax.Ap^h (1) 

Here we intend to evaluate numerically the importance of this constraint. 
We shall show that it is completely negligible in the macroscopic domain and 
that it becomes, on the other hand, crucial on the microscopic level. 


1. Macroscopic system 

Let us take up again the example of a dust particle (cf. complement A,), 
whose diameter is on the order of I p and whose mass m a* 10" 15 kg, having 
a speed v = 10“ 3 m/sec. Its momentum is then equal to: 

p = mv ^ 10“ 18 joule sec/m (2) 

If its position is measured to within 0.01 p, for example, the uncertainty Ap in 
the momentum must satisfy: 

ft 10“ 34 

Ap — — —-— = 10 26 joule sec/m (3) 

Ax 10~ 8 

Thus the uncertainty relation introduces practically no restrictions in this 
case since, in practice, a momentum measurement device is incapable of attaining 
the required relative accuracy of 10 -8 . 

In quantum terms, the dust particle is described by a wave packet whose group 
velocity is v — 10“ 3 m/sec and whose average momentum is p = 10“ 18 joule sec/m. 
But one can then choose such a small spatial extension Ax and momentum 
dispersion Ap that they are both totally negligible. The maximum of the wave 
packet then represents the position of the dust particle, and its motion is 
identical to that of the classical particle. 


2. Microscopic system 

Now let us consider an atomic electron. The Bohr model describes it as a 
classical particle. The allowed orbits are defined by quantization rules which 
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are assumed a priori: for example, the radius c of a circular orbit and the 
momentum p — mv of the electron travelling in it must satisfy: 

pr = rih (4) 

where n is an integer. 

For us to be able to speak in this way of an electron trajectory in classical 
terms, the uncertainties in its position and momentum must be negligible 
compared to r and p respectively: 

Ax 4 r (5-a) 

Ap 4 P (5-b) 

which would mean that: 

( 6 ) 

r p 

Now the uncertainty relation imposes : 

i£.i!£;>A (7) 

r p rp 

If we use formula (4) to replace rp by rih on the right-hand side, this inequality 
can be written as: 

(8) 

r p n 

We then see that (8) is incompatible with (6), unless n > 1. The uncertainty 
relation thus makes us reject the semi-classical picture of the Bohr orbits (see 
§ C-2 of chapter VII). 


References and suggestions for further reading: 

Bohm (5.1), chap. 5, § 14. 


46 



UNCERTAINTY RELATIONS AND ATOMIC PARAMETERS 


Complement Ci 

THE UNCERTAINTY RELATIONS AND ATOMIC PARAMETERS 


The Bohr orbit has no physical reality when coupled with the uncertainty relations 
(c/. complement B,). Later (chap. VII), we shall study the quantum theory of the 
hydrogen atom. We are going to show immediately, however, how the uncer¬ 
tainty relations enable one to understand the stability of atoms and even to 
derive simply the order of magnitude of the dimensions and the energy of the 
hydrogen atom in its ground state. 

Let us consider, therefore, an electron in the coulomb field of a proton, 
which we shall assume to be stationary at the origin of the coordinate system. 
When the two particles are separated by a distance r, the potential energy 
of the electron is: 


V(r) = 


q 2 1 

4 ne 0 r 


( 1 ) 


where q is its charge (exactly opposite to that of the proton). We shall set: 



( 2 ) 


Assume that the state of the electron is described by a spherically symmetric 
wave function whose spatial extent is characterized by r 0 (this means that the 
probability of presence is practically zero beyond 2r 0 or 3r 0 ). The potential 
energy corresponding to this state is then on the order of: 



For it to be as low as possible, it is necessary to take r 0 as small as possible. 
That is, the wave function must be as concentrated as possible about the proton. 

But it is also necessary to take the kinetic energy into account. This is 
where the uncertainty principle comes in : if the electron is confined within 
a volume of linear dimension r 0 , the uncertainty Ap in its momentum is at 
least of the order of h/r 0 . In other words, even if the average momentum is zero, 
the kinetic energy T associated with the state under consideration is not zero : 


2 m 2 mri 


(4) 


If we take r 0 smaller in order to decrease the potential energy, the minimum 
kinetic energy (4) increases. 

The lowest total energy compatible with the uncertainty relation is thus 
the minimum of the function: 


= T„,„ + V = 


2mrQ 


(5) 
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This minimum is obtained for: 

n 2 

r 0 = a o = —j 
me 

and is equal to : 



( 6 ) 

( 7 ) 



FIGURE 1 

Variation with respect to r 0 (extension of 
the wave function) of the potential energy K, 
the kinetic energy 7, and the total energy 
T + V of a hydrogen atom. The func¬ 
tions T and V vary inversely, so the total 
energy passes through a minimum value 
for some value of T and V . The correspon¬ 
ding value a 0 of r 0 gives the order of magni¬ 
tude of the hydrogen atom's size. 


Expression (6) is the one found in the Bohr model for the radius of the 
first orbit, and (7) gives correctly the energy of the ground state of the hydrogen 
atom (see chap. VII; the wave function of the ground state is indeed e -r/ao ). Such 
quantitative agreement can only be accidental, since we have been reasoning 
on the basis of orders of magnitude. However, the preceding calculation reveals 
an important physical idea : because of the uncertainty relation, the smaller the 
extension of the wave function, the greater the kinetic energy of the electron. 
The ground state of the atom results from a compromise between the kinetic 
energy and the potential energy. 

We stress the fact that this compromise, based on the uncertainty relation, 
is totally different from what would be expected in classical mechanics. If the 
electron moved in a classical circular orbit of radius r 0 , its potential energy 
would be equal to: 



The corresponding kinetic energy is obtained by equating the electrostatic force 
and the centrifugal force*: 



( 9 ) 


* In fact, the laws of classical electromagnetism indicate that an accelerated electron radiates, 
which already forbids the existence of stable orbits. 
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which gives: 

_ 1 2 1 e2 

T ‘‘ = 2 mv =27 0 < 10 > 

The total energy would then be equal to: 

e„ - r„+ k,- (ii) 

z r 0 

The most favorable energetic situation would occur at r 0 = 0, which would 
give an infinite binding energy. Thus, we can say that it is the uncertainty 
relation which enables us to understand, as it were, the existence of atoms. 


References and suggestions for further reading: 

Feynman III (1.2), § 2-4. The same type of reasoning applied to molecules: 
Schiff (1.18), first section of § 49. 
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Complement D ( 

AN EXPERIMENT ILLUSTRATING THE UNCERTAINTY RELATION 


Young’s double-slit experiment, which we analyzed in § A-2 of chapter I, led us 
to the following conclusions: both wave and particle aspects of light are 
needed to explain the observed phenomena; but they seem to be mutually 
exclusive, in the sense that it is impossible to determine through which slit each 
photon has passed without destroying, by this very operation, the interference pattern. 
The wave and particle aspects are sometimes said to be complementary. 

We are going to consider Young’s double-slit experiment again to demonstrate 
how complementarity and uncertainty relations are intimately related. To try to 
cast doubt on the uncertainty relation, one can imagine more subtle devices 
than the one of chapter I, which used photomultipliers placed behind the slits. 
We shall now analyze one of these devices. 



FIGURE l 

Diagram of a device using a 
movable plate & whose momentum 
is measured before and after the 
passage of the photon to determine 
whether the photon passed through 
F, or through F 2 before arriving at 
point M on the screen. 


Assume that the plate in which the slits are pierced, is mounted so 
that it can move vertically in the same plane. Thus, it is possible to measure the 
vertical momentum transferred to it. Consider (fig. 1) a photon which strikes 
the observation screen S at point M (for simplicity, we choose a source Sf at infinity). 
The momentum of this photon changes when it crosses 0>. Conservation of 
momentum implies that the plate & absorbs the difference. But the momentum 
thus transferred to # depends on the path of the photon; depending on whether it 
passed through F t or F 2 , the photon has a momentum of: 

Pi = - y sin (1) 
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We then allow the photons to arrive one by one and gradually construct 
the interference pattern on the screen S. For each one, we determine through which 
slit it has passed by measuring the momentum acquired by the plate 0. It 
therefore seems that interference phenomena can still be observed on $ although 
we know through which slit each photon has passed. 

Actually, we shall see that the interference fringes are not visible with this 
device. The error in the preceding argument consists of assuming that only the 
photons have a quantum character. In reality, it must not be forgotten that 
quantum mechanics also applies to the plate & (macroscopic object). If we want 
to know through which hole a photon has passed, the uncertainty Ap in the 
vertical momentum of must be sufficiently small for us to be able to measure 
the difference between p, and p 2 : 

A P < \Pi ~ Pi\ ( 3 ) 

But then the uncertainty relation implies that the position of 0 is only known 
to within Ax, with : 

Ax > ----- (4) 

IP 2 — Pi | 

If we designate by a the separation of the two slits and by d the distance 
between the plate 0 and the screen S, and if we assume that 0, and 0 2 are small 
( d/a > 1), we find (fig. 1): 

x — at.2 

sin 0 t ~ =--— 

a 


sin 0, ~ 0, ~ 


x + a /2 


(x denotes the position of the point of impact M on S’). Formulas (1) and (2) 
then give: 

|P 2 - Pi| - 7-^2 “ ^i| ( 6 ) 

where X = — is the wavelength of light. Substituting this value into formula (4), 
we obtain: 

a /n\ 


51 


COMPLEMENT D, 


But — is precisely the fringe separation we expect to find on S. If the vertical 
a 

position of the slits F x and F 2 is defined only to within an uncertainty greater 
than the fringe separation, it is impossible to observe the interference pattern. 

The preceding discussion clearly shows that it is impossible to construct a 
quantum theory which is valid for light and not for material systems without 
running into serious contradictions. Thus, in the above example, if we could 
treat the plate ^ as a classical material system, we could invalidate the comple¬ 
mentarity of the two aspects of light, and, consequently, the quantum theory of 
radiation. Inversely, a quantum theory of matter alone would come up against 
analogous difficulties. In order to obtain an overall coherence, we must apply 
quantum ideas to all physical systems. 


References and suggestions for further reading : 

Bohm (5.1), chaps. 5 and 6; Messiah (1.17), chap. IV § III; Schiff (1.18), § 4; 
Jammer (5.12), chaps. 4 and 5; also see reference (5.7). 


52 



TWO-DIMENSIONAL WAVE PACKET 


Complement E, 

A SIMPLE TREATMENT 

OF A TWO-DIMENSIONAL WAVE PACKET 


1. Introduction 

2. Angular dispersion and lateral dimensions 

3. Discussion. 


1. Introduction 

In § C-2 of chapter I, we studied the shape of one-dimensional wave 
packets, obtained by superposing plane waves which all propagate in the same 
direction [formula (C-7)]. If this direction is that of the Ox axis, the resulting 
function is independent of y and z. It has a finite extension along Ox, but is not 
limited in the perpendicular directions : its value is the same at all points of a 
plane parallel to yOz. 

We intend to examine here another simple type of wave packets: the 
plane waves which we are going to combine have coplanar wave vectors, 
which are (nearly) equal in magnitude but have slightly different directions. The 
goal is to show how the angular dispersion leads to a limitation of the wave 
packet in the directions perpendicular to the average wave vector. 

We saw in § C-2 of chapter I how, by studying the superposition of 
three specific waves of the one-dimensional packet, one can understand the most 
important aspects of the phenomena. In particular, one can find the fundamental 
relation (C-18) of this chapter. We are going to limit ourselves here to a 
simplified model of this type. The generalization of the results which we are 
going to find can be carried out in the same way as in chapter I (see also 
complement F,). 


2. Angular dispersion and lateral dimensions 


Consider three plane waves, whose wave vectors k t , k 2 and k 3 are shown 
in figure 1. All three are in the xOy plane; k, is directed along Ox \ k 2 and k 3 are 
symmetric with respect to k n the angle between each of them and k, being Ad, 
which we assume to be small. Finally, the projections of k,, k 2 and k 3 on Ox are 
equal: 

k tx ~ k 2x = k 3x - \K\ = k (0 


The magnitudes of these three vectors differ only by terms which are second 
order in Ad, which we shall neglect. Their components along the Oy axis are: 



( 2 ) 
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We shall choose, as in § C-2 of chapter I, real amplitudes g(k) which satisfy 
the relations: 


g(k 2 ) = g(k 3 )=-i S ,(k 1 ) 


( 3 ) 



FIGURE 1 

The arrangement of the wave vectors k,, k 2 and k 3 
associated with three plane waves which will be super¬ 
posed to construct a two-dimensional wave packet. 


This model represents schematically a more complex situation, in which one 
would have a real wave packet, as in equation (C-6) of chapter I, with the 
following characteristics : all the wave vectors are perpendicular to Oz and have 
the same projection on Ox (only the component along Oy varies); the function 
|g(k)| has, with respect to this single variable k y , the shape shown in figure 2; 
its width Ak y is related very simply to the angular dispersion 2A8: 

Ak y = 2 kAB (4) 

The superposition of the three waves defined above gives: 


3 

i Kx,y ) = X b(K) eik " r 

i = 1 


= ff(ki) e i; 


ikx _| gi(kx + k AS y) ^i(kx — k J8 y) 


) 


= gf(kj) e ik *[l -I- cos (k A6y)~\ 


( 5 ) 


(there is no z-dependence, which is why this is called a two-dimensional wave 
packet). 



FIGURE 2 

The three values chosen for k y represent very 
schematically a peaked function |^(k)| (dashed 
line). 
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In order to understand what happens, we can use figure 3, where we represent, 
for each of the three components, the successive wave fronts corresponding to phase 
differences of 2n. The function \ip(x, y)| has a maximum at y = 0 : the three waves 
interfere constructively on the Ox axis. When we move away from this axis, 
|t f/(x, y)| decreases (the phase shift between the components increases) and goes 

to zero at y = ± 4p where Ay is given by: 



( 6 ) 


that is, for: 

kAQAy — 2n (7) 

The phases of the (k 2 ) and (k 3 ) waves are then in opposition with that of the (k,) 
wave (fig. 3). Using (4), we can rewrite (7) in a form which is analogous 
to that of relation (C-l 1) of chapter I: 

Ay . Ak y = 4 n (8) 



FIGURE 3 

Equal phase planes of the three 
waves associated with the three k 
vectors of figure 1: these waves are 
in phase at y - 0, but interfere des¬ 
tructively at y = + 2n/Ak y . 


Thus an angular dispersion of the wave vectors limits the lateral dimensions 
of the wave packets. Quantitatively, this limitation has the form of an uncertainty 
relation [formulas (7) and (8)]. 


3. Discussion 

Consider a plane wave with wave vector k propagating along Ox. Any 
attempt to limit its extension perpendicular to Ox causes an angular dispersion 
to appear, that is, transforms it into a wave packet analogous to the ones we are 
studying here. 
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Assume, for example, that we place in the path of the plane wave a screen 
pierced by a slit of width Ay. This will give rise to a diffracted wave (cf. fig. 4). 
We know that the angular width of the diffraction pattern is given by: 

< 9 > 

2ft 

where l = is the incident wavelength. This is indeed the same situation 
as above : formulas (7) and (9) are identical. 


t 

i 

! Ay 

I 

I 

i 




FIGURE 4 

When the uncertainty Ay is decreased, the 
diffraction of the wave by the diaphragm 
increases the uncertainty Ak y% 


56 






ONE- AND THREE-DIMENSIONAL PROBLEMS 


Complement F, 

THE RELATIONSHIP BETWEEN 

ONE- AND THREE-DIMENSIONAL PROBLEMS 


1. Three-dimensional wave packet 

a. Simple case 

b. General case 

2. Justification of one-dimensional models 


The space in which a classical or quantum particle moves is, of course, 
three-dimensional. This is why we wrote the Schrodinger equation (D-l) in 
chapter I for a wave function \p(r) which depends on the three components x, y, z 
of r. Nevertheless, we have repeatedly used in this chapter a one-dimensional 
model, in which only the x-variable is considered, without justifying this model 
in a very precise way. Therefore, this complement has two purposes: first (§ 1), 
to generalize to three dimensions the results given in § C of chapter I; then (§ 2), 
to show how one can, in certain cases, rigorously justify the one-dimensional 
model. 


1. Three-dimensional wave packet 
a. SIMPLE CASE 

Let us begin by considering a very simple case, for which the following 
two hypotheses are satisfied: 

— the wave packet is free [F(r) = 0] and can therefore be written as 
in equation (C-6) of chapter I: 


#•, t) = -^- 2 jg(k) e« k '-“ (kW d 3 k 

(1) 

— moreover, the function gf(k) is of the form: 

0(k) = (k x ) x g 2 (k y ) x g 3 (k z ) 

(2) 

Recall the expression for cu(k) in terms of k: 


o>(k )=^=±(k 2 x + k 2 y +k 2 z ) 

2m 2m 

(3) 

Substitute (2) and (3) into (1). It is possible to separate the three integrations 
with respect to k x , k y and k z to obtain: 

•Mr, 0 = 0 x •M.V. 0 x 0 

(4) 
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with: 


1 f + * 

<M*, 0 = — 9i(K) e^ x -^ d k x 

J V 2n J ® 

Hkl 

l-w-s <5) 

and analogous expressions for t/^y, 0 and ^ 3 (z, /). 

^(x, /) indeed has the form of a one-dimensional wave packet. In this 
particular case, ip (r, t) is thus obtained simply by taking the product (4) of 
three one-dimensional wave packets, each of which evolves in a totally independent 
way. 


b. GENERAL CASE 

In the general case, where the potential V(r) is arbitrary, formula (1) is 
not valid. It is then useful to introduce the three-dimensional Fourier transform 
g(k, t) of the function \p (r, t) by writing: 


Hr, 0 = 


1 

(2ji) 3/2 


f 


g(k, t) e ikr d 6 k 


( 6 ) 


A priori, the /-dependence of g( k, t), which brings in F(r), is arbitrary. Moreover, 
there is no reason in general why we should be able to express #(k, t) in the 
form of a product, as in (2). In order to generalize the results of § C-2 of 
chapter I, we make the following hypothesis about its k-dependence: |g(k, /)| is 
(at a given time /) a function which has a very pronounced peak for values of k 
close to k 0 and takes on a negligible value when the tip of k leaves a domain D k 
centered atk 0 and of dimensions Ak x , Ak y , Ak z . As above, we set: 

g(k, t) = | 0 (k, /)| e“ (k,,) (7) 

so that the phase of the wave defined by the vector k can be written: 

£(k, r, t) = a(k, t) + k x . x + k y . y + k z . z (8) 

We can set forth an argument similar to that of § C-2 of chapter I. First of all, 
the wave packet attains a maximum when all the waves, for which the tip 
of k is in D k , are practically in phase, that is, when £ varies very little within D k . 
In general, <j(k, r, t) can be expanded about k 0 . Its variation between k 0 and k is, 
to first order in 5k = k — k 0 : 


<5£(k, r, t) ~ Sk x ^j£- £(k, r, t)J + SkJ^- £(k, r, r)J 

.L|f(k,M)] (9) 

L 2 Jk = ko 


+ dk. 
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that is, more concisely*, using (8): 

<5£(k, r, t) s <5k. [V k £(k, r, t)] k=ko 

“ 5k. [r + [V k a(k, t)] k =k 0 ] (10) 

We see from (10) that the variation of £(k, r, /) within the domain D k will be 
minimal for: 

r = = - [V k «(k, 0]k=k o (11) 

We have seen that, under these conditions, |^(r, t)| is maximum. Relation (11) 
therefore defines the position r M (t) of the center of the wave packet and constitutes 
the generalization to three dimensions of equation (C-15) of chapter I. 

In what domain D r , centered at r M and of dimensions Ax, Ay, Az, does the 
wave packet (6) take on non-negligible values? \\j/(r, r)| becomes much smaller 
than |i/'(r M , t)| when the various k waves destroy each other by interference, 
that is, when the variation of £(k, r, t) within the domain D k is of the order 
of 2n (or roughly, of the order of 1 radian). Set Sr = r - if (11) is taken 
into account, relation (10) can be written : 

<5£(k, r, t) dk . dr (12) 

The condition 5£(k, r, t) :> 1 immediately gives us the relations which exist 
between the dimensions of D r and those of D k : 

'Ax . Ak x ;> 1 
<Ay . Ak y £ 1 

Az .Ak z 7>\ (13) 

The Heisenberg uncertainty relations then follow directly from the relation p = fi k: 

Ax . Ap x ^ fi 
Ay . Ap y > Ji 

Az . Ap z :> Ti (14) 

These inequalities constitute the generalization to three dimensions of (C-23) 
of chapter I. 

Finally, note that the group velocity V G of the wave packet can be 
obtained by differentiating (11) with respect to t : 

V 0 = - Jj [v<k, OL-h, (15) 

In the special case of a free wave packet which does not, however, necessarily 
satisfy (2), we have: 

a(k, t) = a(k, 0) — cu(k)t (16) 

where <u(k) is given by (3). Formula (15) then gives: 

Tik 

V 0 = ['Wkfli.i. = — (17) 

which is the generalization of equation (C-31) of chapter L 

* The symbol V designates a " gradient ” : by definition,? f(x, y,‘z) is the vector whose coordi¬ 
nates are d f /dx , d f /dy> d f fdz. The index k in V k means that, as in (9), the differentiations must be per¬ 
formed with respect to the variables k xi k y and k 2 . 
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2. Justification of one-dimensional models 

When the potential is time-independent, we saw in § D-l of chapter I 
that it is possible to separate the time and space variables in the Schrodinger 
equation. This leads to the eigenvalue equation (D-8). We intend to show here 
how it is possible, in certain cases, to extend this method further and to separate 
as well the x, y, z variables in (D-8). 

Assume that the potential energy V(r ) can be written: 

F(r) = V(x, y, z) = V y (x) + V 2 (y) + V 3 (z) (18) 

and let us see if there exist solutions of the eigenvalue equation of the form: 

(p(x, y, z) = (p y (x) x (p 2 (y) x (p 3 (z ) (19) 

An argument analogous to the one set forth in chapter I (§ D-l-a) shows 
that this is possible if: 

2m~dx~ 2 + = El ( 2 °) 

and if we have two other similar equations where x is replaced by y (or z), 
V t by V 2 (or V 3 ), and E x by E 2 (or E 3 ). In addition, it is also necessary that 
the relation: 

E = E x + E 2 + E 3 (21) 

be satisfied. 

Equation (20) is of the same type as (D-8), but in one dimension. 
The x , y and z variables are separated*. 

What happens, for example, if the potential energy V(r) of a particle 
depends only on x? V(r) can then be written in the form (18), where V y = V 
and V 2 = V 3 = 0. Equations (20) in y and z correspond to the case already 
studied, in § C-l of chapter 1, of the free particle in one dimension; their 
solutions are plane waves e iky,y and e ik *' z . All that remains is to solve equation (20), 
which amounts to considering a problem in only one dimension; nevertheless, 
the total energy of the particle in three dimensions is now: 

E = £, + £ K + k D (22) 

The one-dimensional models studied in chapter I thus actually correspond to a 
particle in three dimensions moving in a potential V(r) which depends only on x. 
The solutions q> 2 (y) and (p 3 {z) are then very simple and correspond to particles 
which are “free along Oy” or along Oz. This is why we have concentrated 
all our attention on the study of the x-equation. 


* It can be shown (cf. chap. II, § F-4-a-/?) that, when V(r) has the form (18), all the solutions 
of the eigenvalue equation (D-8) are linear combinations of those we find here. 
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ONE-DIMENSIONAL GAUSSIAN WAVE PACKET: 
SPREADING OF THE WAVE PACKET 


1. Definition of a gaussian wave packet 

2. Calculation of Ax and Ap ; uncertainty relation 

3. Evolution of the wave packet 

a. Calculation of t) 

b. Velocity of the wave packet 

c. Spreading of the wave packet 


In this complement, we intend to study a particular (one-dimensional) free 
wave packet, for which the function g(k) is gaussian. The reason why this 
example is interesting lies in the fact that the calculations can be carried out 
exactly and to the very end. Thus, we can first verify, in this special case, 
the various properties of wave packets which we pointed out in § C of chapter I. 
We shall then use these properties to study the variation in time of the width 
of this wave packet and to reveal the phenomenon of spreading over time. 


1. Definition of a gaussian wave packet 


Consider, in a one-dimensional model, a free particle [K(x) = 0] whose 
wave function at time t = 0 is: 


»Mx, 0) = 


sTa 

- e 4 

(2rc) 3/4 J _ ^ 


e ikx dk 


(1) 


This wave packet is obtained by superposing plane waves e ikx with the coef¬ 
ficients : 


—~= g(k, 0) = 

V 2 7t 


Va 

(2tt) 3/4 


- — (k-k 0 ) 2 

e 4- 


( 2 ) 


which correspond to a gaussian function centered at k = k 0 (and multiplied 
by a numerical coefficient which normalizes the wave function). This is why 
the wave packet (1) is called gaussian. 

In the calculations which follow, we shall repeatedly come upon integrals 
of the type: 


/(a, /?) = 



-a 2 ($ + P ) 2 


d£ 


(3) 
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where a and /? are complex numbers [for the integral (3) to converge, we 
must have Re a 2 > 0]. The method of residues enables us to show that this 
integral does not depend on /?: 

/(a, p) = 7(a, 0) (4) 

and that, when the condition — rc/4 < Arg a < 4- rc/4 is fulfilled (which is always 
possible if Re a 2 > 0), /(a, 0) is given by: 

7(a,0)=-^ 7(1,0) (5) 

Now all that remains is to evaluate 7(1, 0), which can be done classically, through 
a double integration in the xOy plane and a change into polar coordinates : 

s* + QO 

7(1,0)= e-^d Z = (6) 

J — 00 

Thus we have: 



(7) 


with : — rc/4 < Arg a < 4- rc/4. 

Let us now calculate i l/{x, 0). To do this, let us group, in the exponents 
of (1), the ^-dependent terms into a perfect square, by writing them in the 
form: 



(k — k 0 ) 2 4- ikx = 



( 8 ) 


We can then use (7), which yields: 

/ 2 \ 1/4 

</,(x, 0) = (-M e ikox e~ x2/a2 (9) 

We find, as could be expected, that the Fourier transform of a gaussian 
function is also gaussian (cf. appendix I). 

At time t = 0, the probability density of the particle is therefore given by: 



The curve which represents \ij/(x, 0)| 2 is the familiar bell-shaped curve. 
The center of the wave packet [the maximum of |i p(x, 0)| 2 ] is situated at the 
point x = 0. This is indeed what we could have found if we had applied the 
general formula (C-16) of chapter I since, in this particular case, the function 
g(k) is real. 
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2. Calculation of Ax and Ap\ uncertainty relation 

It is convenient, when one is studying a gaussian function f(x) = e~ x2/b2 , to 
define its width Ax precisely by : 

Ax=-^= (11) 

V2 

When x varies from 0 to i Ax, f(x) is reduced by a factor of i/\Te. This 
definition, which is, of course, arbitrary, has the advantage of coinciding with 
that of the “root-mean-square deviation” of the x variable (cf. chap. Ill, § C-5). 

With this convention, we can calculate the width Ax of the wave packet (10), 
which is equal to: 

Ax=j (12) 

We can proceed in the same way to calculate the width Ak, since | g(k, 0)| 2 
is also a gaussian function. This gives: 

Ak = | (13-a) 

or: 

Ap=- a (13-b) 

Thus we obtain: 

Ax . Ap = ^ (14) 

a result which is entirely compatible with Heisenberg’s uncertainty relation. 


3. Evolution of the wave packet 


a. CALCULATION OF i//(x y t) 

In order to calculate the wave function ij/(x, t) at time t, all we need to do 
is use the general formula (C-6) of chapter I, which gives the wave function 
of a free particle; we obtain: 


4>(x, t) = 


e - T <*-*o) 2 e i[i«-»(*)<] dk 


with co(fc) = —— (dispersion relation for a free particle). We shall see that at 
2m 

time t, the wave packet still remains gaussian. Expression (15) can be transformed 
by grouping, as above, all the ^-dependent terms in the exponents into a perfect 
square. We can then use (7), and we find: 
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t) = 


__ e <*o* exp 


where <p is real and independent of x : 

n 2fif 

(p = — 6 -- t with tan 26 = - 


(16-a) 


(16-b) 


Let us calculate the probability density t)\ 2 of the particle at time t. We 
obtain: 


\Mx, 0| 2 = 


exp ■*- 


V m 2 n 4 


2a‘(x - ^ , 
\ m 


/» + oo 

Let us show that the norm of the wave packet, |i^(x, t)| 2 dx, is not 

J - 00 

time-dependent (we shall see in chapter III that this property results from the 
fact that the Hamiltonian H of the particle is Hermitian). We could, to this end, 
use (7) again in order to integrate expression (17) from — oo to + oo. It is 
quicker to observe from expression (15) that the Fourier transform of t) 
is given by: 

g(k, t) = e-^'gik, 0) (18) 

g(k , t) therefore obviously has the same norm as g(k, 0). Now the Parseval-Plancherel 
equation tells us that i/r(x, t) and g(k, t) have the same norm, as do t//(x, 0) and g(k, 0). 
From this we deduce that tj/(x, t) has the same norm as ^(x, 0). 

b. VELOCITY OF THE WAVE PACKET 

We see in (17) that the probability density |^(x, /)| 2 is a gaussian function, 
centered at x = V 0 t, where the velocity V 0 is defined by: 

(19) 

We could have expected this result, in view of the general expression (C-32) 
of chapter I, which gives the group velocity V G . 

c. SPREADING OF THE WAVE PACKET 

Let us take up formula (17) again. The width Ax(t) of the wave packet at 
time t, from definition (11), is equal to: 


... a 4 fi 2 t 2 

Ax(t) =- ! 1 + 

2 \' m‘a 4 
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We see (cf. fig. 1) that the evolution of the wave packet is not confined to 
a simple displacement at a velocity V 0 . The wave packet also undergoes a 
deformation. When t increases from — oo to 0, the width of the wave packet 
decreases, reaching a minimum at / = 0. Then, as t continues to increase, 
Ax{t) grows without bound (spreading of the wave packet). 



FIGURE 1 

For negative r, the gaussian wave packet decreases in width as it propagates. At time t = 0, it is a 
"minimum" wave packet: the product Ax . Ap is equal to H/2 . Then, for t > 0, the wave packet 
spreads again as it propagates. 


It can be seen in (17) that the height of the wave packet also varies, 
but in opposition to the width, so the norm of ^(.v, /) remains constant. 

The properties of the function g(k, t) are completely different. In fact 
[cf. formula (18)]: 

\g(k, t)| = | g(k, 0)| (21) 

Therefore, the average momentum of the wave packet ( hk 0 ) and its momentum 
dispersion (hAk) do not vary in time. We shall see later (cf. chap. Ill) that this 
arises from the fact that the momentum is a constant of the motion for a free particle. 
Physically, it is clear that since the free particle encounters no obstacle, the 
momentum distribution cannot change. 

The existence of a momentum dispersion Ap = hAk = fi/a means that the 

velocity of the particle is only known to within Av = = —. Imagine a 

group of classical particles starting at time t = 0 from the point x = 0, with 
a velocity dispersion equal to Av. At time t, the dispersion of their positions 

will be Sx cl = Av Ifl = this dispersion increases linearly with /, as shown in 

ma 

figure 2. Let us draw on the same graph the curve which gives the evolution in 
time of Ax(t)\ when / becomes infinite, Ax(t) practically coincides with dx cI [the 
branch of the hyperbola which represents Ax(t) has for its asymptotes the straight 
lines which correspond to <5x c( ]. Thus, we can say that, when t is very large, 
there exists a quasi-classical interpretation of the width Ax. On the other hand, 
when t approaches 0, Ax(t) takes on values which differ more and more 
from 6x cl . The quantum particle must indeed constantly satisfy Heisenberg’s uncer- 
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tainty relation Ax . Ap > ft/2 which, since is fixed, imposes a lower limit 
on dx. This corresponds to what can be seen in figure 2. 



Variation in time of the width Ax of the wave packet of figure 1. For large t , Ax approaches the dis¬ 
persion 6x el of the positions of a group of classical particles which left x — 0 at time t — 0 with a 
velocity dispersion Apjm. 


COMMENTS : 

(/) The spreading of the packet of free waves is a general phenomenon 
which is not limited to the special case studied here. It can be shown 
that, for an arbitrary free wave packet, the variation in time of its 
width has the shape shown in figure 2 (cf. exercise 4 of complement L m ). 


(ii ) In chapter I, a simple argument led us in (C-17) to Ax . Ak ~ 1, without making any 
particular hypothesis about g(k\ except for saying that g(k) has a peak of width Ak 
whose shape is that of figure 3 of chapter I (which is indeed the case in this complement). 
Then how did we obtain Ax . Ak > 1 (for example, for a gaussian wave packet when t 
is large)? 

Of course, this is only an apparent contradiction. In chapter I, in order to 
find Ax . Ak ^ 1, we assumed in (C-13) that the argument ct(k) of g(k) could be approxi¬ 
mated by a linear function in the domain Ak. Thus we implicitly assumed a supplementary 
hypothesis: that the nonlinear terms make a negligible contribution to the phase of g(k) 
in the domain Ak. For example, for the terms which are of second order in (k - k 0 \ 
it is necessary that: 



If, on the contrary, the phase cn(k) cannot be approximated in the domain Ak by a linear 
function with an error much smaller than 2tt, we find when we return to the argument 
of chapter I that the wave packet is larger than was predicted by (C-17). 


In the case of the gaussian wave packet studied in this complement, we have Ak ^ - 

a 

fife 2 

and a(k) = —— t. Consequently, condition (22) can be written 
2m 


( 2 )- 

W m 


<4 2n. Indeed, 


we can verify from (20) that, as long as this condition is fulfilled, the product Ax . Ak is 
approximately equal to 1. 


66 





ONE-DIMENSIONAL SQUARE POTENTIALS 


Complement Hi 

STATIONARY STATES OF A PARTICLE 
IN ONE-DIMENSIONAL SQUARE POTENTIALS 


1. Behavior of a stationary wave function <p(x) 

a. Regions of constant potential energy 

b. Behavior of <p(x) at a potential energy discontinuity 

c. Outline of the calculation 

2. Study of certain simple cases 

a. Potential steps 

b. Potential barriers 

c. . Bound states; square well potentials 


We saw in chapter I (cf. § D-2) the interest in studying the motion of a particle 
in a “square potential” whose rapid spatial variations for certain values of x 
introduce purely quantum effects. The shape of the wave functions associated with 
the stationary states of the particle was predicted by considering an optical 
analogy which enabled us to understand very simply how these new physical 
effects appear. 

In this complement, we outline the quantitative calculation of the stationary 
states of the particle. We shall give the results of this calculation for a certain 
number of simple cases, and discuss their physical implications. We limit ourselves 
to one-dimensional models (cf. complement F,). 


1. Behavior of a stationary wave function q>(x) 


a. REGIONS OF CONSTANT POTENTIAL ENERGY 

In the case of a square potential, V(x) is a constant function V(x) = V 
in certain regions of space. In such a region, equation (D-8) of chapter I 
can be written: 


-ri<p(x)+Tr( E - y ) <p(x) = o 
dx 2 Ti 2 

We shall distinguish between several cases: 

(i)E > V 

Let us introduce the positive constant k, defined by 


The solution of equation (1) can then be written: 

(p(x) = A e‘ kx + A’ e~ ,kx 
where A and A' are complex constants. 


( 1 ) 


( 2 ) 

( 3 ) 
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(U) E < V 

This condition corresponds to regions of space which would be forbidden 
to the particle by the laws of classical mechanics. In this case, we introduce 
the positive constant p defined by: 


V — E 


n 2 p 2 

2m 


(4) 


and the solution of (1) can be written: 
< p(x ) = B e px + B' e~ px 


( 5 ) 


where B and B' are complex constants. 

(iii) E = V. In this special case, (p(x) is a linear function of x. 


b. BEHAVIOR OF <p(x) AT A POTENTIAL ENERGY DISCONTINUITY 

How does the wave function behave at a point x = where the potential 
V(x) is discontinuous? It might be thought that at this point the wave function q>(x) 
would behave strangely, becoming itself discontinuous, for example. The aim 
of this section is to show that this is not the case : <p(x) and d(p/dx are continuous, 
and it is only the second derivative d 2 <p/dx 2 that is discontinuous at x = xi. 


Without giving a rigorous proof, let us try to understand this property. To do this, 
recall that a square potential must be considered (cf. chap. I, § D-2-a) as the limit, when e — *■ 0, 
of a potential F e (jc) equal to V(x) outside the interval [x, — e, x, 4- e], and varying contin¬ 
uously within this interval. Then consider the equation : 

— <pJLx) + [E - v £ (xj]<p e (x) = 0 (6) 

dx 2 h 2 

where V E (x) is assumed to be bounded, independently of e, within the interval [ x t — s, + e]. 
Choose a solution <p E (x) which, for x < x t - s 9 coincides with a given solution of (1). The 
problem is to show that, when e —> 0, (p E (x) tends towards a function <p(x) which is continuous 
and differentiable at x = x t . Let us grant that (p E (x) remains bounded, whatever the value of e, 
in the neighborhood of x = x t *. Physically, this means that the probability density remains 
finite.Then,integrating (6) between x t - tj and x x 4- rj , we obtain: 


+ tf )- —( X 1 - n) = [ \V E (x) - £]<p c (x) dx 

dx dx Tr 


(7) 


At the limit where s —> 0, the function to be integrated on the right-hand side of this 
expression remains bounded, owing to our previous assumption. Consequently, if rj tends 
towards zero, the integral also tends towards zero, and: 


£<x,+ 


* This point could be proved mathematically from the properties of the differential equation (I). 
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Thus, at this limit, d<p/dx is continuous at x = x t , and so is (p(x) (since it is the integral of a 
continuous function). On the other hand, d 2 (p/dx 2 is discontinuous, and, as can be seen directly 

from (1), makes a jump at x = x l which is equal to — [where o y represents the change 

ft 2 

in K(x) at x = x,]. 

COMMENT: 

It is essential, in the preceding argument, that ^(x) remain bounded. 
In certain exercises of complement K,, for example, the case is considered 
for which F(x) = a<5(x), an unbounded function whose integral remains 
finite. In this case, (p(x) remains continuous, but dcp/dx does not. 

c. OUTLINE OF THE CALCULATION 

The procedure for determining the stationary states in a “square potential” 
is therefore the following : in all regions where V(x) is constant, write (p(x) in 
whichever of the two forms (3) or (5) is applicable; then “match” these functions 
by requiring the continuity of (p(x) and of d(p/dx at the points where V(x) is 
discontinuous. 


2. Examination of certain simple cases 

Let us now carry out the quantitative calculation of the stationary states, 
performed according to the method described above, for all the forms of V(x) 
considered in § D-2-c of chapter I. Thus we shall verify that the form of the 
solutions is indeed the one predicted by the optical analogy. 

a. POTENTIAL STEPS 



FIGURE I 

Potential step. 


a. Case where E > V 0 ; partial reflection 
Set: 


flmE 

\j h 2 


— k l 


/2m(E - V 0 ) k 
V Ti 2 


(9) 

( 10 ) 
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The solution of (1) has the form (3) in the two regions I (x < 0) and II (x > 0): 
<p,(x) = + A[t~ ik ' x (11) 

<Pu(x) = A 2 e ik2X + A 2 (12) 


Since equation (1) is homogeneous, the calculation method of § 1-c can only 
enable us to determine the ratios A\jA x , A 2 /A l and A'JA^ In fact, the two 
matching conditions at x = 0 do not suffice for the determination of these three 
ratios. This is why we shall choose A 2 = 0, which amounts to limiting ourselves 
to the case of an incident particle coming from x = — oo. The matching 
conditions then give: 


A\ _ fci ~ k 2 
A x k l + k 2 

A_g _ 2ki 

A 1 ^ 1+^2 


(13) 

(14) 


<p,(x) is the superposition of two waves. The first one (the term in A t ) 
corresponds to an incident particle, with momentum p = ftk u propagating from 
left to right. The second one (the term in A[) corresponds to a reflected particle, 
with momentum — Tik i , propagating in the opposite direction. Since we have 
chosen A 2 = 0, <p„(x) consists of only one wave, which is associated with 
a transmitted particle. We shall see in chapter III (cf. § D-l-c-/?) how it is 
possible, using the concept of a probability current, to define the transmission 
coefficient T and the reflection coefficient R of the potential step (see also 
§ 2 of complement B UI ). These coefficients give the probability for the particle, 
arriving from x = — oo, to pass the potential step at x = 0 or to turn back. 
Thus we find: 



and, for T*: 




(15) 


(16) 


Taking (13) and (14) into account, we then have: 

r = i 4fc l fc 2 

(ki + k 2 ) 2 

(kj + k 2 ) 2 


(17) 

(18) 


It is easy to verify that R + T = 1 : it is certain that the particle will be either 
transmitted or reflected. Contrary to the predictions of classical mechanics, 
the incident particle has a non-zero probability of turning back. This point 
was explained in chapter I, using the optical analogy and considering the reflection 


* The physical origin of the factor k 2 /k l which appears in T is discussed in § 2 of complement J,. 
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of a light wave from a plane interface (with n x > n 2 ). Furthermore, we know that 
in optics, no phase delay is created by such a reflection; equations (13) and (14) 
do indeed show that the ratios A\/A i and A 2 /A , are real. Therefore, the quantum 
particle is not slowed down by its reflection or transmission (cf. complement J,, 
§ 2). Finally, it is easy to verify, using (9), (10) and (18), that, if E §> V 0 , T ~ 1 : 
when the energy of the particle is sufficiently large compared to the height 
of the potential step, the particle clears this step as if it did not exist. 


p. Case where E < V 0 ; total reflection 

We then replace (10) and (12) by: 

v h 2 P2 

<p„(x) = B 2 e P2X + B' 2 e~ P2X 

For the solution to remain bounded when x —► 
B 2 = 0 

The matching conditions at x = 0 give in this case : 
A\ _ fc, — ip 2 
A i + ip 2 

Ei Vh 

A x ~ k x + ip 2 

The reflection coefficient R is then equal to: 


A\ 

2 

fei ip 2 

A, 


k x + ip 2 


(19) 

( 20 ) 

+ oo, it is necessary that: 

( 21 ) 

( 22 ) 

(23) 

(24) 


As in classical mechanics, the particle is always reflected (total reflection). 
Nevertheless, there is an important difference, which has already been pointed 
out in chapter 1 : because of the existence of the evanescent wave e~ PlX , the 
particle has a non-zero probability of presence in the region of space which, 
classically, would be forbidden to it. This probability decreases exponentially 
with x and becomes negligible when x is greater than the “range” l/p 2 of 
the evanescent wave. Note also that the coefficient A\/A i is complex. A certain 
phase shift appears upon reflection, which, physically, is due to the fact that the 
particle is delayed when it penetrates the x > 0 region (cf. complement J,, § I 
and also B„„ § 3). This phase shift is analogous to the one which appears 
when light is reflected from a metallic type of substance; however, there is no 
analogue in classical mechanics. 


comment: 

When V 0 —*■ + oo, p 2 — +■ + oo, so that (22) and (23) yield : 


A\ 

B'i 



(25) 
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In the x > 0 region, the wave, whose range decreases without bound, tends 
towards zero. Since (.4, + A\) —*■ 0, the wave function (p(x) goes to zero 
at v = 0, so that ii remains continuous at this point. On the other hand, its 
derivative, which changes abruptly from the value 2ikA 1 to zero, is no longer 
continuous. This is due to the fact that since the potential jump is infinite 
at x — 0, the integral of (7) no longer tends towards zero when rj tends 
towards 0. 

b. POTENTIAL. BARRIERS 





V 0 



o' 

© 1 

1 

© 

f 


FIGURE 2 

Square potential barrier. 


a. Case where E > 1 0 *; resonances 

Using notations (9) and (10), we 
II (0 < </)and III (x > /): 

</>,(.Y) - A { c ik " + A\ e ,ktx 

<Pii(a) — A 2 c' k2X + A ' 2 e ,kix 

</>,„(.y) = A 3 e ik> * + A 3 e 


find, in the three regions I (x < 0), 

(26-a) 
(26-b) 
(26-c) 


Let us choose, as above, A' 3 = 0 (incident particle coming from x = — oo). 
The matching conditions at x — I then give A 2 and A' 2 in terms of A 3 , and those 
at .v = 0 give A , and A\ in terms of A 2 and A' 2 (and, consequently, in terms of A 3 ). 
Thus we find: 


A, = 


a: = 


- ill 


. . . k 2 + k 2 . . 

cos k 2 l — i —Tj—r — sin k 2 l 

Zk j rCj 

k 2 , - k 2 , 


2 k,k 2 


e ik ' l A 


sin kj e‘ M ,4 3 


(27) 


A\/A , and A 3 ;A l enable us to calculate the reflection coefficient R and the transmis¬ 
sion coefficient T of the harrier, which here are equal to: 


A\ 2 _ (Jcf — kj) 2 sin 2 k 2 l 
A , 4k 2 kl 4- (k 2 - k\) 2 sin 2 k 2 l 


(28-a) 


* can either be positive (the case of a potential barrier like the one shown in figure 2) or nega¬ 
tive (a potential well). 
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T = 


Ai 


4 k\k 


2/,2 
2 


4k\k\ + (k, — k\) sin k 2 l 


(28-b) 


It is then easy to verify that R + T = 1. Taking (9) and (10) into account, we have: 


T = 


4 E(E - V 0 ) 


4 E(E - V 0 ) + V 2 0 sin 2 2m(E - K 0 j ///»] 


(29) 


The variations with respect to / of the transmission coefficient T are shown 
in figure 3 (with E and V 0 fixed): T oscillates periodically between its minimum 


, 1 


value, 1 + 


V 2 

__o 


4 E(E - V 0 ) 


, and its maximum value, which is 1. This function 


IT 



FIGURE 3 

Variations of the transmis¬ 
sion coefficient T of the bar¬ 
rier as a function of its width 
(the height V 0 of the harrier 
and the energy E of the par¬ 
ticle are fixed). Resonances 
appear each time that l is an 
integral multiple of the half¬ 
wavelength tz 1 k 2 in region II. 


is the analogue of the one which describes the transmission of a Fabry-Perot 
interferometer. As in optics, the resonances (obtained when T — I, that is, 
when k 2 l = nn) correspond to the values of / which are integral multiples of ihe 
half-wavelength of the particle in region II. When E > V 0 , the reflection of the 
particle at each of the potential discontinuities occurs without a phase shift of 
the wave function (cf. § 2-a-a). This is why the resonance condition k 2 l — nn 
corresponds to the values of l for which a system of standing waves can exist 
in region II. On the other hand, far from the resonances, the various waves which 
are reflected at x = 0 and x = l destroy each other by interference, so that the 
values of the wave function are small. A study of the propagation of a wave packet 
(analogous to the one in complement J,) would show that, if the resonance condi¬ 
tion is satisfied, the wave packet spends a relatively long time in region II. In quantum 
mechanics this phenomenon is called resonance scattering. 


(3. Case where E < V 0 ; tunnel effect 


We must now replace (26-b) by (20), p 2 still being given by (19). The matching 
conditions at x = 0 and x = l enable us to calculate the transmission coefficient 
of the barrier. In fact, it is unnecessary to perform the calculations again : all 
we must do is replace, in the equations obtained in §a, k 2 by — ip 2 . We then have : 


T = 



_ 4/^p -E) _ 

4 E{V 0 - E)+ F 2 sinh 2 E)/7r] 


(30) 
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with, of course, R = 1 — T. When p 2 l §> 1, we have : 

T ~ 16£( ^> — — e ~ 2p2 ' (31) 

F 2 
K o 

We have already seen, in chapter I, why, contrary to the classical predictions, the 
particle has a non-zero probability of crossing the potential barrier. The wave 
function in region II is not zero, but has the behavior of an “evanescent wave” of 
range l/p 2 . When / $ l/p 2 , the particle has a considerable probability of crossing 
the barrier by the “tunnel effect”. This effect has numerous physical applications : 
the inversion of the ammonia molecule (cf. complement G IV ), the tunnel diode, the 
Josephson effect, the a-decay of certain nuclei, etc... 


For an electron, the range of the evanescent wave is: 



1.96 

vV 0 — E 


A 


(32) 


where E and V 0 are expressed in electron-volts (this formula can be obtained immediately by 
replacing, in formula (8) of complement A,, X = 2njk by 2n/p 2 ). Now consider an electron 
of energy 1 eV which encounters a barrier for which V 0 = 2 eV and / = 1 A. The range of the 
evanescent wave is then 1.96 A, that is, of the order of / : the electron must then have a consi¬ 
derable probability of crossing the barrier. Indeed, formula (30) gives in this case: 

T ^ 0.78 (33) 

The quantum result is radically different from the classical result : the electron has approximately 
8 chances out of 10 of crossing the barrier. 

Let us now assume that the incident particle is a proton (whose mass is about 1 840 times 
that of the electron). The range l/p 2 then becomes: 


(lj - l% A- 46 

Wpr v/l 840 (v 0 - E) \/V 0 - E 


(34) 


If we retain the same values : E = 1 e V, V 0 = 2 eV, / = 1 A, we find a range 1 jp 2 much smaller 
than /. Formula (31) then gives : 


T - 4 x 10 “ 19 (35) 

Under these conditions, the probability of the proton’s crossing the potential barrier is negligible. 
This is all the more true if we apply (31) to macroscopic objects, for which we find such small 
probabilities that they cannot possibly play any role in physical phenomena. 


c. BOUND STATES: SQUARE WELL POTENTIAL 

a. Well of finite depth 

We shall limit ourselves here to studying the case — V 0 < E < 0 (the 
case E > 0 was included in the calculations of the preceding section h- a). 
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FIGURE 4 

Square well potential. 


In regions I ( x < - ~), II ( - ~ < x < and 111 (x > we have. 


respectively: 

(P\(x) = 5, e' ,v + B\ e px 
<?„(*) = A 2 e ikx + A’ 2 e~ ikx 
(p ni (x) = B 3 t px + B 3 e~ px 

with: 

I 2mE 
P = ~ ~ 7 - 


V 


k = 


2 m(E + V Q ) 


Since (p{x) must be bounded in region I, we must have : 
B\ = 0 


(36-a) 

(36-b) 

(36-c) 


(37) 

(38) 

(39) 


The matching conditions at x = — then give : 

A — p(-p+ifc)a/2 P + D 
A * ~ 6 2 ik 1 


A 2 — 


q~ (P + ik)a/ 2 £__Eq^ 


2ik 


and those at x = a/2: 


B e~ pa 

~B l = 4U^t {p + ik)1 ^ ~ ip ~ ik)2 e "“‘ ] 

Ei 

By 




2 kp 


(40) 


(41) 


But (p(x) must also be bounded in region III. Therefore, it is necessary 
that B 3 — 0, that is: 
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( 42 ) 


Since p and k depend on E, equation (42) can only be satisfied for certain 
values of E. Imposing a bound on cp(x) in all regions of space thus entails the 
quantization of energy. More precisely, two cases are possible: 

(0 if: 


P - ik _ _ e ika 
p + ik 


(43) 


we have: 



We then obtain: 


1 


cos 2 



1 + tan 2 — = 


2 ka 


2 


k 2 4- p 2 
k 2 



(44) 


(45) 


(46) 


Equation (43) is thus equivalent to the system of equations: 



(47-a) 

(47-b) 



FIGURE 5 

Graphic solution of equation (42), giving the energies of the bound states of a particle in a square 
well potential. In the case shown in the figure, there exist five bound states, three even (associated 
with the points P of the figure), and two odd (points 7). 
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The energy levels are determined by the intersection of a straight line, having 
a slope l/k 0 , with sinusoidal arcs (long dashed lines in figure 5). Thus we obtain 
a certain number of energy levels, whose wave functions are even. This becomes clear 
if we substitute (43) into (40) and (41); it is easy to verify that B' 3 = B x and that 
A 2 = A' 2 , so that (p( — x) = (p(x ). 

(« ) if : 


P ~ ik = e ika 

p + ik 


(48) 


a calculation of the same type leads to : 



(49-a) 

(49-b) 


The energy levels are then determined by the intersection of the same straight line 
as before with other sinusoidal arcs (cf. short dashed lines in figure 5). The levels 
thus obtained fall between those found in (/). It can easily be shown that the 
corresponding wave functions are odd. 


comment: 


If k 0 < —, that is, if: 
a 


n 2 Ti 2 

Vo < V, =— (50) 

2 mar 

figure 5 shows that there exists only one bound state of the particle, and this state has 
an even wave function. Then, if V x ^ V 0 < 4V i , a first odd level appears, and so on : 
when V 0 increases, there appear alternatively even and odd levels. If V 0 > V lt the 
slope l/k 0 of the straight line of figure 5 is very small : for the lowest energy levels, 
we practically have: 


k = 


mi 


where n is an integer, and consequently: 

„2_2fc2 

n ft n 


E = 


2 ma 2 


- V n 


(51) 


(52) 


P. Infinitely deep well 

Assume V(x) to be zero for 0 < x < a and infinite everywhere else. Set: 

k. 

v h 2 


(53) 


According to the comment made at the end of §2-a-P of this complement, (p(x) must 
be zero outside the interval [0, a], and continuous at x = 0, as well as at x = a. 
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Now for 0 ^ x ^ a : 

(p(x) = Ae ikx + A'e~ ikx (54) 

Since (p(0) = 0, it can be deduced that A' = — A, which leads to : 

(p(x) = 2 iA sin kx (55) 

Moreover, (p(a) = 0, so that: 

k = — (56) 

a 

where n is an arbitrary positive integer. If we normalize function (55), taking (56) 
into account, we then obtain the stationary wave functions: 



with energies: 

n 2 n 2 fi 2 
" “ 2ma 2 

The quantization of the energy levels is thus, in this case, particularly simple. 


(57) 


(58) 


COMMENTS : 

(/) Relation (56) simply expresses the fact that the stationary states are determined 
by the condition that the width a of the well must contain an integral number 
of half-wavelengths, njk. This is not the case when the well has a finite 
depth (cf. § a); the difference between the two cases arises from the phase 
shift of the wave function which occurs upon reflection from a potential 
step (cf. § 2-a-p). 

(ii) It can easily be verified from (51) and (52) that, if the depth V 0 of a finite well 
approaches infinity, we find the energy levels of an infinite well. 


References and suggestions for further reading: 

Eisberg and Resnick (1.3), chap. 6; Ayant and Belorizky (1.10), chap. 4; Messiah (1.17), 
chap. Ill; Merzbacher (1.16), chap. 6; Valentin (16.1), annex V. 
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Complement J| 

BEHAVIOR OF A WAVE PACKET AT A POTENTIAL STEP 


1. Total reflection : E < V 0 

2. Partial reflection : E > V 0 


In complement H,, we determined the stationary states of a particle in various 
“square” potentials. For certain cases (a step potential, for example), the stationary 
states obtained consist of unbounded plane waves (incident, reflected and trans¬ 
mitted). Of course, since they cannot be normalized, such wave functions cannot 
really represent a physical state of the particle. However, they can be linearly 
superposed to form normalizable wave packets. Moreover, since such a wave packet 
is expanded directly in terms of stationary wave functions, its time evolution is 
very simple to determine. All we need to do is multiply each of the coefficients of the 
expansion by an imaginary exponential with a well-defined fre- 

E 

quency — (chap. I, § D-l-b). 

We intend, in this complement, to construct such wave packets and study 
their time evolution for the case where the potential presents a “step” of 
height V 0 , as in figure 1 of complement H,. In this way, we shall be able to 
describe precisely the quantum behavior of the particle when it arrives at the 
potential step by determining the motion and the deformation of its associated 
wave packet. This will also enable us to confirm various results obtained in H, 
through the study of the stationary states alone (reflection and transmission 
coefficients, slowing down upon reflection, etc...). 

We shall set: 


12 mE 


V k 2 


= k 


- K. 


( 1 ) 


V Ti 2 

and, as in complement H„ we shall distinguish between two cases, corresponding to k 
smaller or greater than A^ 0 . 


1. Total reflection: E < V 0 

In this case, the stationary wave functions are given by formulas (11) and (20) 
of complement H| (k Y will be simply called k here), the coefficients A 1 , A' t , B 2 and B' 2 
of these formulas being related by equations (21), (22) and (23) of H,. 

We are going to construct a wave packet from these stationary wave func¬ 
tions by linearly superposing them. We shall choose only values of k less than K 0 , 
so as to have the waves forming the packet undergo total reflection. In order to do 
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this, we shall choose a function g(k) (which characterizes the wave packet) which 
is zero for k > K 0 . We are going to concentrate our attention on the negative 
region of the x-axis, to the left of the potential barrier. In complement H„ rela¬ 
tion (22) shows that the coefficients A t and A[ of expression (11) for a stationary 
wave in this region have the same modulus. Therefore, we can set: 

A\(k) , 21 sac) (j\ 

A,ik) 

with [cf. formula (19) of H,]: 

\/K 2 — k 2 

tan 0(k) = - B j k — - (3) 

Finally, the wave packet which we are going to consider can be written, at 
time t = 0, for negative x: 


1 c 

0) = —= dkg(k) [e ik * + e ~ 2im e" ikjt ] (4) 

2 JT « o 

As in § C of chapter I, we assume that \g(k)\ has a pronounced peak of width Ak 
about the value k = k 0 < K 0 . 

In order to obtain the expression for the wave function i(/(x , t) at any 
time t, we simply use the general relation (D-14) of chapter I: 


1 f Ko 

^(x, t ) = —= c 

V 2 n Jo 


V 

+ 


d kg(k) e^-^w 


i r Ko ( 

\J2n Jo 


d kg(k)e~ ilkx+ ‘ o(k)t+2B{k)] 


( 5 ) 


where o o(k) = Tik 2 /2m. By construction, this expression is valid only for negative x. 
Its first term represents the incident wave packet; its second term, the reflected 
packet. For simplicity, we shall assume g(k) to be real. The stationary phase 
condition (cf. chap. I, § C-2) then enables us to calculate the position x f of the 
center of the incident wave packet: if, at k — k 0 , we set the derivative with respect 
to k of the argument of the first exponential equal to zero, we obtain : 



( 6 ) 


In the same way, the position x r of the center of the reflected packet is obtained by 
differentiating the argument of the second exponential. Differentiating equation (3), 
we find : 


[1 + tan 2 0] dd = 




dd 


dk 


= -^sj'K 2 0 -k 2 - 


dk 


k 2 


( 7 ) 
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that is: 


, 1 -- 

k 2 k‘ _ k 2 

Thus we have: 


( 8 ) 


x 


r 




k = ko 


hk 0 


m 


t + 


s/K 2 0 - k 2 0 


(9) 


Formulas (6) and (9) enable us to describe more precisely the motion of the 
particle, localized in a region of small width Ax centered at x f or x T . 

First of all, let us consider what happens for negative /. The center x t of the 
incident wave packet propagates from left to right with a constant velocity hkjm. 
On the other hand, we see from formula (9) that x r is positive, that is, situated 
outside the region x < 0 where expression (5) for the wave function is valid. This 
means that, for all negative values of x , the various waves of the second term of (5) 
interfere destructively: for negative t, there is no reflected wave packet , but only 
an incident wave packet like those we studied in § C of chapter 1. 

The center of the incident wave packet arrives at the barrier at time / = 0. 
During a certain interval of time around t = 0, the wave packet is localized in the 
region x a: 0 where the barrier is, and its form is relatively complicated. But, 
when t is sufficiently large, we see from (6) and (9) that it is the incident wave 
packet which has disappeared, and we are left with only the reflected wave packet. 
It is now x, which is positive, while x r has become negative : the waves of 
the incident packet interfere destructively for all negative values of x, while those 
of the reflected packet interfere constructively for x = x r < 0. The reflected wave 
packet propagates towards the left at a speed of — hkjm, opposite to that of the 
incident packet, whose mirror image it is; its form is unchanged*. Moreover, for¬ 
mula (9) shows that the reflection has introduced a delay x, given by : 



( 10 ) 


Contrary to what is predicted by classical mechanics, the particle is not instanta¬ 
neously reflected. Note that the delay t is related to the phase shift 2 6(k) between 
the incident wave and the reflected wave for a given value of k. Nevertheless, it 
should be observed that the delay of the wave packet is not simply proportional 
to 6{k 0 ), as would be the case for an unbounded plane wave, but to the deriva¬ 
tive dd/dk evaluated at k = k 0 . Physically, this delay is due to the fact that, for t 
close to zero, the probability of presence of the particle in the region x > 0, 
which is classically forbidden, is not zero [evanescent wave, see comment (i) below]. 
It can be said, metaphorically, that the particle spends a time of the order of x 
in this region before retracing its steps. Formula (10) shows that the closer the 


* We assume Ak to be small enough for tfie spreading of the wave packet to be negligible during 
the time interval considered. 
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average energy 1a of the wave packet is to the height V 0 of the barrier, the 

longer the delay r. 


comments : 


(0 


Here we have stressed the study of the wave packet for x < 0, but it is also possible to 
study what happens for jc > 0. In this region, the wave packet can be written: 

1 f Ko 

t) = —= d k g(k) B' 2 (k) e ~ fi,k,x e ~ ia,,k> ' (11) 

v 2n •'o 


where : 


p(k) = y/Kl - k 2 


( 12 ) 


B' 2 (k) is given by equation (23) of complement H, when we replace A x by 1, k x by k 
and p 2 by p. An argument analogous to the one in § C-2 of chapter I then shows that the 
modulus |^(x, t)| of expression (11) is maximum when the phase of the function to be 
integrated over k is stationary. Now, according to expressions (22) and (23) of H,, the 
argument of B' 2 is half that of A i, which, according to (2), is equal to - 20(k). Conse¬ 
quently, if we expand co(k) and 0(A:) in the neighborhood of k — k 0 , we obtain, for the 
phase of the function to be integrated over & in (11): 


{- RjH - tel t\(k - k 0 ) = - ^ (k ~ kjt -1 
1 U*-U„ L d *-U„f m V 2 


(13) 


[we have used (10) and the fact that g(k) is assumed real]. From this we can deduce 
that | \j/(x 9 /)| is maximum in the x > 0 region for t = The time at which the wave 
packet turns back is therefore r/2, which gives us the same delay r upon reflection that 


we obtained above. We also see from expression (13) that, as soon as 
the time At defined by: 


i 

1 ~2 


I 


m 


exceeds 


(14) 


(«) 


where Ak is the width of g(k\ the waves go out of phase and expression (11) for \ij/(x , f)| 
becomes negligible. Thus, the wave packet as a whole remains in the x > 0 region 
during an interval of time At of the order of: 


nk 0 /m 


(15) 


which corresponds approximately to the time it takes, in the x < 0 region, to travel a 
a distance comparable to its width 1 /Ak, 

Since Ak is assumed to be much smaller than k 0 and K 0 , the comparison of (10) and (15) 
shows that: 


At P T 


(16) 


* Note that the phase (13) does not depend on x, contrary to what we found in chapter I for a 
free wave packet. It follows that, in the x > 0 region, /)| does not have a pronounced peak which 
moves with respect to time. 
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The delay upon reflection thus involves, for the reflected wave packet, a displacement 
which is much smaller than its width. 


2. Partial reflection : E > V 0 


This time, we shall consider a function g(k) of width Ak , centered at a 
value k = k 0 > K 0 , which is zero for k < K 0 . The wave packet is formed in this 
case by superposing, with coefficients g(k ), the stationary wave functions whose 
expressions are given by formulas (11) and (12) of complement H,. We shall 
choose A' 2 = 0 so as to have the particle being considered arrive at the barrier 
from the negative region of the Ox axis, and we shall take A, = 1. The coef¬ 
ficients A\(k) and A 2 (k) are obtained from formulas (13) and (14) of complement H, 

(in which A t is replaced by 1, k, by k , and k 2 by \fk* — K.\). In order to describe 
the wave packet by a single expression, valid for all values of x, we can use the 
Heaviside “step function” 0(x) defined by: 


0(x) = 0 if x < 0 

0(x) = 1 if x > 0 (17) 


The wave packet under consideration can then be written: 


iHx, t) = 0(- x) 

V 2n 

+ 6(- x)-J= 

V 2 n 


C + oo 

K 0 

/* + 00 


dkg{k) 


d kg(k)A\(k)e 


- £[kx + ct>(k)2] 


K 0 


. /» T UU _ 

+ t>(x)—p= dkg(k)A 2 (k)e il ' fk2 - K ° x -‘° (k),] (18) 

V 2n J k 0 

This time, we find three wave packets : incident, reflected and transmitted. 
As in § 1 above, the stationary phase condition gives the position of their respective 
centers x ( , x r and x ( . Since A\(k) and A 2 (k) are real, we find: 





m 


(19-a) 







m 


h jkl - Kl t 
m 


(19-b) 

(19-c) 


A discussion analogous to that of (6) and (9) leads to the following conclusions : 
for negative t, only the incident wave packet exists; for sufficiently large positive t, 
only the reflected and transmitted wave packets exist (fig. 1). Note that there is no 
delay, either upon reflection or upon transmission [this is due to the fact that the 
coefficients A\(k) and A 2 (k) are real]. 

The incident and reflected wave packets propagate with velocities of Tikjm 
and — hk 0 /m respectively. Let us assume Ak to be sufficiently small that, within 
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FIGURE I 

Behavior of a wave packet at a potential step, in the case E > V 0 . The potential is shown in figure a. 
In figure b, the wave packet is moving towards the step. Figure c shows the wave packet during the 
transitory period in which it splits in two. Interference between the incident and reflected waves are 
responsible for the oscillations of the wave packet in the .v < 0 region. After a certain time (fig. d), 
we find two wave packets. The first one (the reflected wave packet) is returning towards the left; its 
amplitude is smaller than that of the incident wave packet, and its width is the same. The second 
one (the transmitted wave packet) propagates towards the right; its amplitude is slightly greater 
than that of the incident wave packet, but it is narrower. 


the interval 




Ak 

2 


, we can neglect the variation of A \ (k) compared to 


that of g(k). We can then, in the second term of (18), replace A\(k) by A\(k 0 ) and 
take it outside the integral. It is then easy to see that the reflected wave packet 
has the same form as the incident wave packet, its mirror image. Its amplitude is 
smaller, however, since, according to formula (13) of complement H,, A\(k 0 ) 
is less than 1. The reflection coefficient R is, by definition, the ratio between 
the probabilities of finding the particle in the reflected wave packet and in the 
incident packet. Therefore, we have R = |.<4' 1 (fc 0 )| 2 , which indeed corresponds to 
equation (15) of complement H, [recall that we have chosen A^k^) — 1], 

The situation is different for the transmitted wave packet. We can still use the 
fact that Ak is very small in order to simplify its expression : we replace A 2 (k) 
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by A 2 {k 0 ), and \fk 2 — Kl by the approximation : 


x / k 2 ^ K'i ~ Jk 2 0 - Kl + (k 
- Qo + ( k ~ k 0 )^ 

‘/o 

with: 

<?o = V — K 0 


, r b 

M - 


x'k 2 - Kl 

d k 


k - ko 


( 20 ) 


( 2 )) 


The transmitted wave packet can then be written : 
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Let us compare this expression to the one for the incident wave packet : 
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We see that: 
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(24) 


The transmitted wave packet thus has a slightly greater amplitude than that of the 
incident packet: according to formula (14) of complement H,, A 2 (k 0 ) is greater 
than 1. However, its width is smaller, since, if j t )j has a width ,1 formula (24) 
shows that the width of [i p,(x, l )| is: 

{Ax), = ^ Ax (25) 

«0 


The transmission coefficient (the ratio between the probabilities of finding the 
particle in the transmitted packet and in the incident packet) is seen to he the product 
of two factors: 

T =T M2M 2 ( 26 ) 

K 0 

This indeed corresponds to formula (16) of complement H,, since A x {k 0 ) = 1. 
Finally, note that, taking into account the contraction of the transmitted wave 
packet along the Ox axis, we can find its velocity: 

v ,27, 

m k 0 m 


References and suggestions for further reading: 

Schiff (1.18), chap. 5, figs 16, 17, 18, 19; Eisberg and Resnick (1.3), § 6-3, fig. 6-8; also 
see reference (1.32). 
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COMPLEMENT K, 


Complement K, 
EXERCISES 


1. A beam of neutrons of constant velocity, mass M n ( M n ~ 1.67 x 10“ 27 kg) 
and energy E, is incident on a linear chain of atomic nuclei, arranged in a regular 
fashion as shown in the figure (these nuclei could be, for example, those of a long 
linear molecule). We call / the distance between two consecutive nuclei, and d, 
their size (d <£ l). A neutron detector D is placed far away, in a direction which 
makes an angle of 6 with the direction of the incident neutrons. 



a ) Describe qualitatively the phenomena observed at D when the energy E 
of the incident neutrons is varied. 

b) The counting rate, as a function of E, presents a resonance about E = E l . 
Knowing that there are no other resonances for E < E v show that one can deter¬ 
mine /. Calculate / for 0 = 30° and £, = 1,3 x 10“ 20 joule. 

c ) At about what value of E must we begin to take the finite size of the nuclei 
into account ? 


2. Bound state of a particle in a "delta function potential" 


Consider a particle whose Hamiltonian H [operator defined by formula (D-10) 
of chapter I] is : 


H = 


Ti 2 d : 


a (5(x) 


2m dx 2 

where a is a positive constant whose dimensions are to be found. 


a) Integrate the eigenvalue equation of H between — e and + e. Letting e 
approach 0, show that the derivative of the eigenfunction (p(x ) presents a discon¬ 
tinuity at x = 0 and determine it in terms of a, m and (p( 0). 


b) Assume that the energy E of the particle is negative (bound state). cp(x) can 
then be written: 


x < 0 <p(x) = Ay e" x + A[ t~ px 

x > 0 <p(x) = A 2 e px + A' 2 e~ px 

Express the constant p in terms of E and m. Using the results of the preceding 
question, calculate the matrix M defined by: 
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Then, using the condition that <p(x) must be square-integrable, find the possible 
values of the energy. Calculate the corresponding normalized wave functions. 

c) Trace these wave functions graphically. Give an order of magnitude for 
their width Ax. 

d) What is the probability d 0>(p) that a measurement of the momentum 
of the particle in one of the normalized stationary states calculated above will 
give a result included between p and p + dp ? For what value of p is this proba¬ 
bility maximum ? In what domain, of dimension Ap, does it take on non-negligible 
values? Give an order of magnitude for the product Ax . Ap. 


3. Transmission of a “delta function" potential barrier 

Consider a particle placed in the same potential as in the preceding exercise. 
The particle is now propagating from left to right along the Ox axis, with a positive 
energy E. 

a) Show that a stationary state of the particle can be written: 

f if x < 0 (p(x) = e ikx + A e~ ikx 

\i( x > 0 (p(x) = B e ikx 

where k, A and B are constants which are to be calculated in terms of the 

energy E , of m and of a (watch out for the discontinuity in at x = 0). 

dx 

b) Set — E l = — mot 2 /2ft 2 (bound state energy of the particle). Calculate, 
in terms of the dimensionless parameter E/E l , the reflection coefficient R and the 
transmission coefficient T of the barrier. Study their variations with respect to E ; 
what happens when E — *• oo ? How can this be interpreted ? Show that, if the 
expression of T is extended for negative values of E , it diverges when E —-> — E L , 
and discuss this result. 


4. Return to exercise 2, using, this time, the Fourier transform. 


a) Write the eigenvalue equation of H and the Fourier transform of this 
equation. Deduce directly from this the expression for 7p{p ), the Fourier transform 
of <p(x), in terms of p, E, a and <p(0). Then show that only one value of E, a negative 
one, is possible. Only the bound state of the particle, and not the ones in which it 
propagates, is found by this method; why? Then calculate (p(x) and show that 
one can find in this way all the results of exercise 2. 

b ) The average kinetic energy of the particle can be written (cf. chap. Ill): 



dp 
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Show that, when (p(p) is a “sufficiently smooth” 


E k =- 

2m 


Ti 2 f d 2 cp 

1 <p*(x) ^4-dx 


dx 2 


function, we also have: 


These formulas enable us to obtain, in two different ways, the energy E k for a 
particle in the bound state calculated in a). What result is obtained ? Note that, in this 
case, (p(x) is not “regular” at x = 0, where its derivative is discontinuous. It is 
then necessary to differentiate <p(x) in the sense of distributions, which introduces 
a contribution of the point x = 0 to the average value we are looking for. Interpret 
this contribution physically : consider a square well, centered at x = 0, whose 
width a approaches 0 and whose depth V 0 approaches infinity (so that aV 0 — a), 
and study the behavior of the wave function in this well. 


5. Well consisting of two delta functions 


Consider a particle of mass m whose potential energy is 
V(x) — — a<5(x) — aS(x — /) a > 0 


where / is a constant length. 

a) Calculate the bound states of the particle, setting E = 
the possible energies are given by the relation 


h 2 p 2 

Show that 
2m 



where p is defined by p = ——. Give a graphic solution of this equation. 

ft 2 

(0 Ground state. Show that this state is even (invariant with respect to 
reflection about the point x = 1/2), and that its energy E s is less than the energy — E L 
introduced in problem 3. Interpret this result physically. Represent graphically the 
corresponding wave function. 

(ii) Excited state. Show that, when / is greater than a value which you are to 
specify, there exists an odd excited state, of energy E A greater than — E L . Find the 
corresponding wave function. 


(Hi) Explain how the preceding calculations enable us to construct a model 
which represents an ionized diatomic molecule (//j , for example) whose nuclei are 
separated by a distance /. How do the energies of the two levels vary with respect 
to l ? What happens at the limit where / —*• 0 and at the limit where / —> oo ? 
If the repulsion of the two nuclei is taken into account, what is the total energy of the 
system? Show that the curve which gives the variation with respect to / of the 
energies thus obtained enables us to predict in certain cases the existence of bound 
states of H\, and to determine the value of / at equilibrium. In this way we obtain 
a very elementary model of the chemical bond. 

b) Calculate the reflection and transmission coefficients of the system of 
two delta function barriers. Study their variations with respect to l. Do the 
resonances thus obtained occur when l is an integral multiple of the de Broglie 
wavelength of the particle ? Why ? 
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6 . Consider a square well potential of width a and depth V 0 (in this exercise, 
we shall use systematically the notation of § 2-c-a of complement H,). We intend 
to study the properties of the bound state of a particle in this well when its width a 
approaches zero. 


a) Show that there indeed exists only one bound state and calculate its 

( YtlV^ Q? 

energy E (we find E ~-, that is, an energy which varies with the square 


2 n 2 


of the area aV 0 of the well). 


b) Show that p —> 0 and that A' 2 = A 2 ^ BJ 2. Deduce from this that, 
in the bound state, the probability of finding the particle outside the well 
approaches 1. 

c ) How can the preceding considerations be applied to a particle placed, as 
in exercise 2, in the potential V(x) = — a<5(x)? 


7. Consider a particle placed in the potential 

V(x) = 0 if x ^ a 

F(x) = - F 0 ifO < x < a, 

with F(x) infinite for negative x. Let tp(x) be a wave function associated with 
a stationary state of the particle. Show that <p(x) can be extended to give an odd 
wave function which corresponds to a stationary state for a square well of width 2a 
and depth V 0 (cf. complement H,, § 2-c-a). Discuss, with respect to a and V 0 , the 
number of bound states of the particle. Is there always at least one such state, as for 
the symmetric square well ? 


8 . Consider, in a two-dimensional problem, the oblique reflection of a particle 
from a potential step defined by: 

V(x,y) = 0 ifx < 0 
F(x,j)=F 0 if x > 0 

Study the motion of the center of the wave packet. In the case of total reflection, 
interpret physically the differences between the trajectory of this center and the 
classical trajectory (lateral shift upon reflection). Show that, when V 0 —> + oo, the 
quantum trajectory becomes asymptotic to the classical trajectory. 
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OUTLINE OF CHAPTER II 


A. ONE-PARTICLE 

WAVE FUNCTION SPACE 


1. Structure of the wave function space & 

a . & as a vector space 

b . The scalar product 

c. Linear operators 

2. Discrete orthonormal bases in { « { (r) } 

a. Definition 

b. Components of a wave function in the { « f (r)} basis 

c. Expression for the scalar product in terms of the components 

d. Closure relation 

3. Introduction of “bases” not belonging to & 

a . Plane waves 

b . “Delta functions” 

c . Generalization : continuous “orthonormal” bases 


B. STATE SPACE. 
DIRAC NOTATION 
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2. “ Ket ” vectors and “ bra ” vectors 
a . Elements of S: kets 

b> Elements of the dual space S* of g : bras 
c. Correspondence between kets and bras 

3. Linear operators 
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b. Examples of linear operators: projectors 

4. Hermitian conjugation 

a. Action of a linear operator on a bra 

b. The adjoint operator A % of a linear operator A 

c . Correspondence between an operator and its adjoint 

d. Hermitian conjugation in Dirac notation 

e. Hermitian operators 


C. REPRESENTATIONS 
IN THE STATE SPACE 
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a . Definition of a representation 

b. Aim of § C 

2. Relations characteristic of an orthonormal basis 
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b. Closure relation 

3. Representation of kets and bras 

a. Representation of kets 
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4. Representation of operators 

a . Representation of A by a “ square ” matrix 
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d. Matrix representation of the adjoint A* of A 
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5. Change of representations 

a . Statement of the problem 

b . Transformation of the components of a ket 

c. Transformation of the components of a bra 

d. Transformation of the matrix elements of an operator 


D. EIGENVALUE EQUATIONS. 
OBSERVABLES. 


1. Eigenvalues and eigenvectors of an operator 

a . Definitions 

b . Finding the eigenvalues and eigenvectors of an operator 

2. Observables 

a. Properties of the eigenvalues and eigenvectors of a Hermitian 
operator 

b . Definition of an observable 

c. Example: the projector P+ 
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a . Important theorems 

b. Complete sets of commuting observables (C.S.C.O.) 


E. TWO IMPORTANT EXAMPLES 
OF REPRESENTATIONS 
AND OBSERVABLES 
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F. TENSOR PRODUCT 
OF STATE SPACES 


1. Introduction 

2. Definition and properties of the tensor product 

a . Tensor product space & 

b. Tensor product of operators 

c . Notation 

3. Eigenvalue equations in the product space 

a . Eigenvalues and eigenvectors of extended operators 
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4. Applications 

a . One- and three-dimensional particle states 

b . States of a two-particle system 
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CHAPTER II THE MATHEMATICAL TOOLS OF QUANTUM MECHANICS 


This chapter is intended to be a general survey of the basic mathematical tools 
used in quantum mechanics. We shall give a simple condensed presentation aimed 
at facilitating the study of subsequent chapters for readers unfamiliar with these 
tools. We make no attempt to be mathematically complete and rigorous. We feel it 
preferable to limit ourselves to a practical point of view, uniting in a single chapter 
the various concepts which are useful in quantum mechanics. In particular, we wish 
to stress the convenience of the Dirac notation for carrying out the various 
calculations which we shall have to perform. 

In this spirit, we shall try to simplify the discussion as much as possible. 
Neither the general definitions nor the rigorous proofs which would be required 
by a mathematician are to be found here. For example, we shall sometimes speak 
of infinite-dimensional spaces and reason as if they had a finite number of dimensions. 
Moreover, many terms (square-integrable functions, basis, etc...) will be employed 
with a meaning which, although commonly used in physics, is not exactly the one 
used in pure mathematics. 

We begin in § A by studying the wave functions introduced in chapter I. 
We show that these wave functions belong to an abstract vector space, which we 
call the “wave function space ,‘F ". This study will be carried out in detail as it 
introduces some basic concepts of the mathematical formalism of quantum 
mechanics : scalar products, linear operators, basis, etc... Starting in § B, we shall 
develop a more general formalism, characterizing the state of a system by a 
“state vector” belonging to a vector space : the “state space S ”. Dirac notation, 
which greatly simplifies calculations in this formalism, is introduced. § C is intended 
to study the idea of a representation. The reading of § D is particularly recom¬ 
mended to the reader who is unfamiliar with the diagonalization of an operator : 
this operation will be constantly useful to us in what follows. In § E, we treat two 
important examples of representations. In particular, we show how the wave func¬ 
tions studied in § A are the “components” of state vectors in a particular repre¬ 
sentation. Finally, we introduce in § F the concept of a tensor product. This 
concept will be illustrated more concretely by a simple example in complement D, v . 


A. ONE-PARTICLE WAVE FUNCTION SPACE 

The probabilistic interpretation of the wave function ij/(r,t) of a particle was 
given in the preceding chapter : |i^(r, t )\ 2 d 3 r represents the probability of finding, 
at time t, the particle in a volume d 3 r = dxdydz about the point r. The total 
probability of finding the particle somewhere in space is equal to 1, so we must have : 

d*r \tl/(r, t )| 2 = 1 (A-l) 

C 

where the integration extends over all space. 

Thus, we are led to studying the set of square-integrable functions. These are 
functions for which the integral (A-l) converges. This set is called 1} by mathema¬ 
ticians and it has the structure of a Hilbert space. 

From a physical point of view, it is clear that the set L 2 is too wide in scope: 
given the meaning attributed to |^(r, t) | 2 , the wave functions which are actually 
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used possess certain properties of regularity. We can only retain the func¬ 
tions i/t(r, t) which are everywhere defined, continuous, and infinitely differentiable 
(for example, to state that a function is really discontinuous at a given point in 
space has no physical meaning, since no experiment enables us to have access to 
real phenomena on a very small scale, say of 10" 30 m). It is also possible to confine 
ourselves to wave functions which have a bounded domain (which makes it certain 
that the particle can be found within a finite region of space, for example inside the 
laboratory). We shall not try to give a precise, general list of these supplementary 
conditions: we shall call 9 the set of wave functions composed of sufficiently regular 
functions of Ls (9 is a subspace of I}). 


1. Structure of the wave function space # 
a. -f AS A VECTOR SPACE 

It can easily be shown that 9 satisfies all the criteria of a vector space. As an 
example, we demonstrate that if j (r) and i// 2 ( r) e 9. then* : 

•A(r) = A,iMr) + ^ 2 0 r 2(**) e 9 (A-2) 

where and X 2 are two arbitrary complex numbers. 

In order to show that *//(r) is square-integrable, expand |>A(r )| 2 : 

\'l'(r )\ 2 = |/-,| 2 |</'i(r )| 2 + |A 2 | 2 |<Mr )| 2 + '^2<K( r )'M r ) + A,A*^,(r)^*(r) (A-3) 

The last two terms of (A-3) have the same modulus, which has as an upper limit: 

IM R2I [| | /'i ( r )| 2 + liMM 2 ] 

|i/f(r)| 2 is therefore smaller than a function whose integral converges, since ij/ x and ij/ 2 are 
square-integrable. 


b. THE SCALAR PRODUCT 


a. Definition 

With each pair of elements of (p( r) and ^(r), taken in this order, we 

associate a complex number , denoted by (</>, i//), which, by definition, is equal to: 


(<A ty) = 



<p*(r) ^(r) 


(A-4) 


(<p, i //) is the scalar product of by <p(r) [this integral always converges if q> and i/r 
belong to -9], 

(3. Properties 

They follow from definition (A-4) : 


* Tlie symbol e signifies: “belongs to". 
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(A-5) 

(A-6) 

(A-7) 


The scalar product is linear with respect to the second function of the pair, anti- 
linear with respect to the first one. If {<p, i (/) = 0, <p(r) and tft(r) are said to be 
orthogonal. 


(<P, = Wh (P)* 

(<p, + A 2 i j/ 2 ) = A,(<p, i^,) + A 2 (<p, ij/ 2 ) 

(Ajipt + A 2 c p 2 , i p) = A?(<p l5 1 It) + A 2 (<p 2 , <A) 


PP, 4*) 


d 3 r |^(r)| 


2 


(A-8) 


is a real, positive number, which is zero if and only if ^(r) = 0. 

y /(^, i p) is called the norm of ^(r) [it can easily be verified that this number 
has all the properties of a norm]. The scalar product chosen above thus permits 
the definition of a norm in !?. 

Let us mention, finally, {cf complement A„) the Schwarz inequality: 


|(<An 4>i)\ ^ , tpi) VK, *P 2 ) 


(A-9) 


This becomes an equality if and only if the two functions if/ 1 and i/^ 2 are proportional. 


c. LINEAR OPERATORS 

a. Definition 

A linear operator A is, by definition, a mathematical entity which associates 
with every function t/V(r) e fF another function t^'(r), the correspondence being 
linear: 


(A-10-a) 
(A-10-b) 

Let us cite some simple examples of linear operators : the parity operator II, 
whose definition is: 

Ih//(x, y, z) = t//(- x, - y, - z) (A-11) 

the operator which performs a multiplication by x, which we shall call X, and which 
is defined by: 


nr) = A<P(r) 

A[l i^i(r) + A 2 i^ 2 (r)] = A,^,(r) + X 2 A\p 2 (r) 


X\f/(x, y, z) = x\f/(x, y, z) (A-12) 

finally, the operator which differentiates with respect to x, which we shall call D x , 
and whose definition is: 


D x \j/(x, y, z) 


dtpjx, y, z) 
dx 


(A-13) 


[the two operators X and D x , acting on a function i//(r) e can transform it into 
a function which is no longer necessarily square-integrable]. 
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p. 


Product of operators 

Let A and B be two linear operators. Their product AB is defined by: 


(ABM r) = A[&K r)] 


(A-14) 


B is first allowed to act on \f/{r), which gives <p(r) = B\ft(r), then A operates on the 
new function (p(r). 

In general, AB ^ BA. We call the commutator of A and B the operator 
written [A, 5] and defined by 


[A, B] = AB - BA 


(A-15) 


We shall calculate, as an example, the commutator [A", Df\. In order to do this, we shall 
take an arbitrary function ^(r): 

d d 

= x—<//(r) - t//(r) - x — r) = - ^(r) (A-16) 


Since this is true for all ^(r), it can be deduced that: 


[X, D x ] = - 1 


(A-17) 


2. Discrete orthonormal bases in & : {«/,(r) } 

a. DEFINITION 

Consider a countable set of functions of &, labeled by a discrete index i 
(i = 1, 2, n, ...): 

«i(r) e J\ u 2 (r)e# r , ..., u t (r) ... 

— The set (u;(r) } is orthonormal if : 

(«i - u j) = | d3r u *i r ) u ji r ) = &U (A-18) 

where the Kronecker delta function, is equal to 1 for i = j and to 0 for i ^ j. 

— It constitutes a basis* if every function ^(r) e !F can be expanded in one 
and only one way in terms of the «,(r): 


<A(r) = X c i “.(*) 

i 


(A-19) 


* When the set { M,.(r)} constitutes a basis, it is sometimes said to be a complete set of functions. 
It must be noted that the word complete is used with a meaning different from the one it usually has in 
mathematics. 
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b. COMPONENTS OF A WAVE FUNCTION IN THE | u,( r) \ BASIS 

Multiply the two sides of (A-19) by uj(r) and integrate over all space. 
From (A-6)and (A-18)* : 


(“r M = (“j' Z c < = Z «<) 

= Z <*i ^ ‘ j 


that is: 


<■, = (»,. <A) 




(r) i//(r) 


(A-20) 

(A-21) 


The component c, of i[/(r) on //,(r) is therefore equal to the scalar product 
of ^(r) by w,(r). Once the [ «,(r) J basis has been chosen, it is equivalent to 
specify i//(r) or the set of its components r, with respect to the basis functions. The 
set of numbers c t is said to represent i/^(r) in the J n f (r) J basis. 

comments : 

(;) Note the analogy with an orthonormal basis [ e,, e 2 , e, } of the ordinary three-dimen¬ 
sional space, R 3 . The fact that e,, e 2 and e, are orthogonal and unitary can indeed be 
expressed by: 

e,. tj = S,j (/, / =1,2, 3) (A-22) 

Any vector V of can be expanded in this basis: 

V = £ v, e, (A-23) 

; - i 

with 

v i = * V (A-24) 

Formulas (A-18). (A-19) and (A-21) thus generalize, as it were, the well-known formulas, 
(A-22), (A-23) and (A-24). However, it must he noted that the v t are real numbers, while 
the c\ are complex numbers. 

(ii) The same function t^(r) obviously has different components in two different bases. We 
shall study the problem of a change in basis later. 

(iii) We can also, in the [ // t (r) J basis, represent a linear operator A by a set of numbers 
which can be arranged in the form of a matrix. We shall take up this question again 
in § C, after we have introduced Dirac notation. 

c. EXPRESSION FOR THE SCALAR PRODUCT IN TERMS OF THE COMPONENTS 

Let cp(r) and ij/{r) be two wave functions which can be expanded as follows: 

<p(r) = V b { r) 

i 

Mr) = Z c i U M) < A ' 25 ) 

j 

* To be completely rigorous, one should make certain that one can interchange £ and I d*r. 
We shall systematically ignore this kind of problem. ‘ ■' 
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Their scalar product can be calculated by using (A-6), (A-7) and (A-18): 


(<P, 0 


(l An-1 

I V, <s,' 



that is: 



(A-26) 


(A-27) 


The scalar product of two wave functions (or the square of the norm of a wave 
function) can thus be very simply expressed in terms of the components of these 
functions in the { M f (r)} basis. 

comment: 

Let V and W be two vectors of R 3 , with components t) ; and Wj. The analytic 
expression of their scalar product is well-known: 

3 

v. w = X ^ w i (a-28) 

■'= 1 

Formula (A-26) can therefore be considered to be a generalization of (A-28). 


d. CLOSURE RELATION 


Relation (A-18), called the orthonormalization relation, expresses the fact 
that the functions of the set [ u,(r)} are normalized to 1 and orthogonal with 
respect to each other. We are now going to establish another relation, called the 
closure relation, which expresses the fact that this set constitutes a basis. 

If { u f (r) } is a basis of J*, there exists an expansion such as (A-19) for every 
function ^(r)e/. Substitute into (A-19) expression (A-21) for the various 
components c, [the name of the integration variable must be changed, since r 
already appears in (A-19)]: 


MO = Z c f u i( r ) = I («.’ M M i( r ) 

t t 

= l[^V uf(r') iA(r')J u,(r) 
Interchanging X an( i J> 

<A(r) = |d 3 r' i//(r')^X u ,( r ) u f( r ')J 


we obtain: 


(A-29) 


(A-30) 
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X u i( r )uf(r')is therefore a function F( r, r') of r and of r' such that, for every func¬ 
tion ^(r), we have: 

lA( r ) = j* dV ^(r') F(r, r') (A-31) 

Equation (A-31) is characteristic of the function <5(r — r') (cf. appendix II). From 
this it can be deduced that: 


I>,-(r) uf(r') = 6 (r - r') 


(A-32) 


Reciprocally, if an orthonormal set { u ; (r) } satisfies the closure relation (A-32), 
it constitutes a basis. Any function iA(r) can indeed be written in the form: 

t//(r) = f d 3 r' i//(r') 6(r - r') (A-33) 


Substituting expression (A-32) for <5(r — r') into this expression, we obtain for¬ 
mula (A-30). To return to (A-29), all we must do is again interchange summation 
and integration. This equation then expresses the fact that can always be 
expanded in terms of the « f (r) and gives the coefficients of this expansion. 


comment : 

We shall re-examine the closure relation using the Dirac notation in § C, and we 
shall see that it can be given a simple geometric interpretation. 


3. Introduction of "bases" not belonging to & 

The { «,(r) } bases studied above are composed of square-integrable functions. 
It can also be convenient to introduce “bases” of functions not belonging to 
either & or L 2 , but in terms of which any wave function i//(r) can nevertheless be 
expanded. We-are going to give examples of such bases and we shall show how it is 
possible to extend to them the important formulas which were established in the 
preceding section. 


a. PLANE WAVES 

For simplicity, we treat the one-dimensional case. We shall therefore study 
square-integrable functions >j/(x) which depend only on the x variable. In chapter I 
we saw the advantage of using the Fourier transform ip(p) of i//(x): 


•AW = 


•AW = 


1 

V 2 nfi 

1 

2nli 



dp <A(p) e ipx/ft 


dx i (f(x) e ipx/fl 


(A-34-a) 


(A-34-b) 
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v p (x) = 


1 gipx/h 

\! Inh 


(A-35) 


v p (x) is a plane wave, with the wave vector pfh. The integral over the whole x axis 
of |u p (x)| 2 = 2 ^ diverges. Therefore c p (x) f & x . We shall designate by { v p (x )} 

the set of all plane waves, that is, of all functions v p (x) corresponding to the various 
values of p. The number p, which varies continuously between — oo and + oo, 
will be considered as a continuous index which permits us to label the various 
functions of the set { v p (x) } [recall that the index i used for the set { M,(r) } consi¬ 
dered above was discrete]. 

Formulas (A-34) can be rewritten using (A-35): 


(A-36) 

(A-37) 


1 r +0 ° 


< Kx) = dp i l/{p) v p (x) 

J — oo 

<A(p) = ( v P ,4 ') = 

+ + OO 

dx v*{x) tA(x) 

J 

— OO 


These two formulas can be compared to (A-19) and (A-21). Relation (A-36) expresses 
the idea that every function ip(x) e & x can be expanded in one and only way 
in terms of the v p (x), that is, the plane waves. Since the index p varies continuously 
and not discretely, the summation £ appearing in (A-19) must be replaced by 

i __ 

an integration over p. Relation (A-37), like (A-21), gives the component t//(p) 
of ij/(x) on r„(x) in the form of a scalar product* ( v p , \J/). The set of these components, 
which correspond to the various possible values of p, constitutes a function of p, ip(p), 
the Fourier transform of ip(x). 

Thus, \f(p) is the analogue of c t . These two complex numbers, which depend 
either on p or on i, represent the components of the same function i f/(x) in two 
different bases: { v p (x) } and { u^x) }. 

This point also appears clearly if we calculate the square of the norm of \(/{x). 
According to Parseval’s relation [app. I, formula (45)], we have: 



(A-38) 


a formula which resembles (A-27), if we replace c, by tj/(p) and £ by J dp. 

Let us show that the « p (x) satisfy a closure relation. Using the for¬ 
mula \cf appendix II, equation (34)]: 

+ or j 

d k e iku = S(u) (A-39) 


_ 1 _ 
2 n 


l* T 

— C 


♦ We have only defined the scalar product for two square-integrable functions, but this definition 
can easily be extended to cases like this one, provided that the corresponding integral converges. 
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we find: 


/» + on 


dp v p (x) v*(x') = — 


ft 


X ') 


= 8(x - x') 


(A-40) 


This formula is the analogue of (A-32) with, again, the substitution of 



Finally, let us calculate the scalar product ( v p , v p ,) in order to see if there 
exists an equivalent of the orthonormalization relation. Again using (A-39), we 
obtain: 


(V v p -) = 

that is: 


dx v*{x) v Jx) 




(A-41) 


Compare (A-41) and (A-18). Instead of having two discrete indices i and j and 
a Kronecker delta 5 i} , we now have two continuous indices p and p' and a delta 
function of the difference between the indices, S(p — p'). Note that if we set p = p', 
the scalar product (v p , v p ) diverges; again we see that v p (x) $ & x . Although this 
constitutes a misuse of the term, we shall call (A-41) an “orthonormalization” 
relation. It is also sometimes said that the t> p (x) are “ orthonormalized in the 
Dirac sense ”. 

The generalization to three dimensions presents no difficulties. We consider 


the plane waves : 

/ 1 \ 3/2 

(A-42) 

The functions of the {t> p (r) } basis now depend on the three continuous indices p x , 
p y , p., condensed into the notation p. It is then easy to show that the following 
formulas are valid: 

^(r) = j d 3 p ij/(p) v p (r) 

(A-43) 

IT 

v fan 

m 

II 

5 

H 

1-^- 

(A-44) 

4>) = 

* 

d 3 p <p*(p) i%) 

(A-45) 

p 3 p y P ( r 

) »?(<•') = 5 ( r ~ r ') 

(A-46) 

(v v) = 

% - P') 

(A-47) 


They represent the generalizations of (A-36), (A-37), (A-38), (A-40) and (A-41). 
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Thus the v p (r) can be considered to constitute a “continuous basis”. All the 
formulas established above for the discrete basis { W;(r) } can be extended to this 
continuous basis, using the correspondence rules summarized in table (II-1). 


Table (IM) 


b. "DELTA FUNCTIONS" 

In the same way, let us introduce a set of functions of r, { £ ro (r) }, labeled 
by the continuous index r 0 (condensed notation for jc 0 , y 0 , z 0 ) and defined by: 

U r ) = jfr (A-48) 

{ £r 0 ( r )} represents the set of delta functions centered at the various points r 0 of 
space; £ ro (r) is obviously not square-integrable : <J, n (r) £ 2F. 

Then consider the following relations, which are valid for every function i)/(r) 


belonging to &: 


<A( r ) = flro) <5(r - r 0 ) 

(A-49) 

<A(r 0 ) = pr ^(r 0 - r) ip(r) 

(A-50) 


They can be rewritten, using (A-48), in the form: 

(A-51) 
(A-52) 


(A-51) expresses the fact that every function i^(r)e & can be expanded in one 
and only one way in terms of the £ ro (r). (A-52) shows that the component of i//(r) 
on the function £ ro (r) (we are dealing here with real basis functions) is precisely the 
value i/'(r 0 ) of ^(r) at the point r 0 . (A-51) and (A-52) are analogous to (A-19) 
and (A-21): we simply replace the discrete index i by the continuous index r 0 , 

and £ by j* d 3 r 0 . 

i/^(r 0 ) is therefore the equivalent of c, : these two complex numbers, which 
depend either on r 0 or on i, represent the components of the same function ip(r) in 
two different bases: { £ ro (r) } and { M f (r) }. 

103 




Cohen-TannOUDJI. - Quantum mechanics. - 


5 







CHAPTER II THE MATHEMATICAL TOOLS OF QUANTUM MECHANICS 


Formula (A-26) becomes here: 



(A-53) 


We see that the application of (A-26) to the case of the continuous basis { £ ro (r) } 
results in the definition (A-4) of the scalar product. 

Finally, note that the £ ro (r) satisfy “ orthonormalization ” and closure rela¬ 
tions of the same type as those for the r p (r). Thus we have [formula (28) 
of appendix II]: 



(A-54) 


(A-55) 


All the formulas established for the discrete basis { u,( r )} can thus be 
generalized for the continuous basis { £ rn (r)}, using the correspondence rules 
summarized in table (II-2). 



Table (II-2) 


IMPORTANT COMMENT: 

The usefulness of the continuous bases which we have just introduced 
is revealed more clearly in what follows. However, we must not lose sight of 
the following point : a physical state must always correspond to a square-integrable 
wave function. In no case can t; p (r) or £ ro (r) represent the state of a particle. 
These functions are nothing more than intermediaries, very useful in calculations 
involving operations on the wave functions t/f(r) which are used to describe a physical 
state. 

An analogous situation is encountered in classical optics, where the plane 
monochromatic wave is a mathematically very useful, but physically unrealizable, 
idealization. Even the most selective filters always permit the passage of a 
frequency band Av, which may be very small but is never exactly zero. 

The same holds true for the functions £ ro (r). We can imagine a square- 
integrable wave function, localized about r 0 , for example: 

C(0 = ^ <£> ( r - r o) = <5 ,£, (* - x 0 )S^(y - y 0 )S^(z - z 0 ) 
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where the S {R) are functions which have a peak of width e and amplitude — „ centered 

e 

/» + 00 

at x 0 » y 0 or z 0 , such that <5 (£) (x — x 0 ) dx = 1 (see § 1-b of appendix II 

for examples of such functions). When e —► 0, ^ £> (r) —> £ ro (r), which is 
no longer square-integrable. But, in fact, it is impossible to have a physical 
state which corresponds to this limit: as localized as the physical state of a 
particle may be, t: is never exactly zero. 

c. GENERALIZATION : CONTINUOUS “ORTHONORMAL” BASES 

a. Definition 

Generalizing the results obtained in the two preceding paragraphs, we 
shall call a continuous"' orthonormal" basis , a set of functions of r, { H’ a (r) }, labeled by 
a continuous index a, which satisfy the two following relations, called ortho¬ 
normalization and closure relations: 


(A-56) 
(A-57) 


comments: 

(/) If a = a', (w a , w a ) diverges. Therefore, w a (r) £ 

(//) a can represent several indices, as is the case for r 0 and p in the above examples. 

(Hi) It is possible to imagine a basis which includes both functions w,(r), labeled by a discrete 

index, and functions w a (r), labeled by a continuous index. In this case, the set of w,(r) 
does not form a basis; the set of w a (r) must be added to it. 

Let us cite an example of this situation. Consider the case of the square well 
studied in § D-2-c of chapter I (see also complement H,)- As we shall see later, the set of 
stationary states of a particle in a time-independent potential constitutes a basis. For 
E < 0, we have discrete energy levels, to which correspond square-integrable wave 
functions labeled by a discrete index. But these are not the only possible stationary states. 
Equation (D-17) of chapter I is also satisfied, for all E > 0, by solutions which are bounded 
but which extend over all space and are thus not square-integrable. 

In the case of a “mixed” (discrete and continuous) basis, ( w f (r), %(r) {, the 
orthonormalization relations are: 

(w t , Wj ) - S u 

(*v a ,wv) = S( a - a') (A-58) 

(w* = 0 

And the closure relation becomes: 

£ u,(r) w*(r') + [da w,(r) w*(r ) = ^(r - r') (*-59) 


WV) = jrf 3 '' <(0 *v(r) = <*(a - a') 

| da w 3 (r) w*(r') = <5(r - r') 
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p. Components of a wave function ip(r) 
We can always write : 

«Mr) = f df ip(r') <5(r - r') 


(A-60) 


Using the expression for <5(r — r') given by (A-57), and assuming that we can 


we obtain : 


reverse the order of J* d 3 r' and da, we obtaii 
<A(r) = Jda JdV w*(r') iA(r')Jw» 

that is : t//(r) = J da c(a) w a (r) 

with c(a) = (w a ,^p) =J*dV w*(r') lA(r') 


(A-61) 


(A-62) 


(A-61) expresses the fact that every wave function tp(r) has a unique expansion 
in terms of the w a (r). The component c(a) of if/(r) on w a (r) is equal, according 
to (A-62), to the scalar product (w a , tp). 

y. Expression for the scalar product and the norm in terms of the components 

Let <p(r) and t//(r) be two square-integrable functions whose components 
in terms of the (r) are known: 


<p( r) = da 6(a) w„(r) 


<A( r ) = da' c(a') wy(r) 


(A-63) 


(A-64) 


Calculate their scalar product: 


?, ip) =■- j d 3 r <p*(r) ip( r) 

= j*da j*da' 6*(a) c(a') j* d 3 r w*(r) w a .(r) 


(A-65) 


The last integral is given by (A-56): 


(<P, •A) 


M da ' 


b*(a) c( a') 6(a — a') 


that is: 


((p, ip) = da b*( a) c(a) 


(A-66) 
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In particular: 



(A-67) 


All the formulas of § A-2 can thus be generalized, using the correspondence 
rules of table (II-3). 



Table (II-3) 


The most important formulas established in this section are assembled in 
table (II-4). In fact, it is not necessary to remember them in this form: we shall 
see that the introduction of Dirac notation enables us to rederive them very simply. 


Table (II-4) 



Discrete basis { H f (r) } 

Continuous basis { w a (t )} 

Ortho¬ 

normalization 

relation 

= Sfj 

1 

II 

« 

£ 

B 

Closure 

relation 

£ u,<r) u*( r') = <5(r - r') 

i 

Jda w a (r) <(r') = «$(r - r') 

Expansion 
of a wave 
function t//(r) 

= £ c, u,( r) 

» 

#0 = Jda c(a) 

Expression 
for the components 
of i^(r) 



Scalar product 

- £ bfc, 

i 

(<p, ^) = Jda 6*(a) c(a) 

Square 
of the norm 

Ws = £ V\ 2 

i 

Ws tfO = Jda |c(a)| 2 
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B. STATE SPACE. DIRAC NOTATION 

1. Introduction 

In chapter I, we stated the following postulate : the quantum state of a 
particle is defined, at a given instant, by a wave function t^(r). The probabilistic 
interpretation of this wave function requires that it be square-integrable. This 
requirement led us to study the ,^-space (§ A). We then found, in particular, 
that the same function t//(r) can be represented by several distinct sets of 
components, each one corresponding to the choice of a basis [table (II-5)]. 
This result can be interpreted in the following manner: { c f }, or i^(p), or c(«), 
characterizes the state of a particle just as well as the wave function i^(r) [if 
the basis being used has been specified previously]. Furthermore, ij/{r) itself 
appears, in table (II-5), on the same footing as { c f }, (/'(p) and c(a): the value ij/(r 0 ) 
which the wave function takes on at a point r 0 of space can be considered as its 
component with respect to a specific function £ ro (r) of a particular basis 
(the 8 function basis). 


Basis 

Components of i^(r) 

«i( r ) 

Cj, / = 1,2, ..., n, ... 


•A(p) 


•AM 

%(«•) 

c{ a) 


Table (II-5) 

We thus find ourselves in a situation which is analogous to the one 
encountered in ordinary space, R 3 : the position of a point in space can be 
described by a set of three numbers, which are its coordinates with respect to a 
system of axes defined in advance. If one changes axes, another set of coordinates 
corresponds to the same point. But the geometrical vector concept and vector 
calculation enable us to avoid referring to a system of axes; this considerably 
simplifies both formulas and reasoning. 

We are going to use a similar approach here : each quantum state of a 
particle will be characterized by a state vector, belonging to an abstract space, 
«? r , called the state space of a particle. The fact that the space !F is a subspace 
of 13 means that S t is a subspace of a Hilbert space. We are going to define 
the notation and the rules of vector calculation in S t . 

Actually, the introduction of state vectors and the state space does more than 
merely simplify the formalism. It also permits a generalization of the for¬ 
malism. Indeed, there exist physical systems whose quantum description cannot 
be given by a wave function : we shall see in chapters IV and IX that this 
is the case when the spin degrees of freedom are taken into account, even for 
a single particle. Consequently, the first postulate that we shall set forth in 
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chapter II] will be the following: the quantum state of any physical system is 
characterized by a state vector, belonging to a space $ which is the state space of the 
system. 

Therefore, in the rest of this chapter, we are going to develop a vector 
calculus in <f. The concepts which we are going to introduce and the results 
which we shall obtain are valid for whatever physical system we might consider. 
Nevertheless, to illustrate these concepts and results, we shall apply them to the 
simple case of a (spinless) particle, since this is the case we have previously 
considered. 

We shall begin, in this paragraph, by defining the Dirac notation, which 
will prove to be very useful in the formal manipulations which we shall have 
to perform. 


2. "Ket" vectors and "bra" vectors 

a. ELEMENTS OF S: KETS 

a. Notation 

Any element, or vector, of <f-space is called a ket vector, or, more 
simply, a ket. It is represented by the symbol | ), inside which is placed a 
distinctive sign which enables us to distinguish the corresponding ket from all 
others, for example: | ^ ). 

In particular, since the concept of a wave function is now familiar to us, 
we shall define the space S t of the states of a particle by associating with 
every square-integrable function if/(r) a ket vector | ip > of S t \ 

i/^(r) g <=> | \j/ ) g S, (B-1) 

Afterwards, we shall transpose the different operations which we introduced 
for into S t . Although 3P and <? r are isomorphic, we shall carefully distinguish 
between them in order to avoid confusion and to reserve the possibilities of 
generalization mentioned above in § B-l. We stress the fact that an r-dependence 
no longer appears in | ip ); only the letter t// appears to remind us with which 
function it is associated. iJ/( r) will be interpreted (§ E) as the set of the components 
of the ket | ^ ) in a particular basis, r playing the role of an index [cf. § A-3-b and 
table (II-5)]. Consequently, the procedure which we are adopting here consists 
in initially characterizing a vector by its components in a privileged coordinate 
system, which will later be treated on the same footing as all other coordinate 
systems. 

We shall designate by S x the state space of a (spinless) particle in only one 
dimension, that is, the abstract space constructed as in (B-l), but using wave 
functions which depend only on the x variable. 

p. Scalar product 

With each pair of kets | q> ) and 1 1 // ), taken in this order, we associate a 
complex number, which is their scalar product, (| q> ), | ^ )), and which satisfies 
the various properties described by equations (A-5), (A-6) and (A-7). We shall 
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later rewrite these formulas in Dirac notation after we have introduced the 
concept of a “bra”. 

In S t , the scalar product of two kets will coincide with the scalar product 
defined above for the associated wave functions. 

b. ELEMENTS OF THE DUAL SPACE S* OF £ : BRAS 

a. Definition of the dual space $ * 

Recall, first of all, the definition of a linear functional defined on the 
kets | \J/ > of S. A linear functional x is a linear operation which associates a 
complex number with every ket | if/ ) : 

| ip ) eS -*-*• number x(\*P }) 

xi*i | <Ai > + ^2 I ^2 >) = *iX{\ <Ai » + * 2 x (\» (B-2) 

Linear functional and linear operator must not be confused. In both cases, one is 
dealing with linear operations, but the former associates each ket with a complex number, while 
the latter associates another ket. 

It can be shown that the set of linear functionals defined on the kets 
| t/r > e & constitutes a vector space, which is called the dual space of S and 
which will be symbolized by S’*. 

p. Bra notation for the vectors of S’* 

Any element, or vector, of the space S* is called a bra vector, or, more 
simply, a bra. It is symbolized by < |. For example, the bra < x | designates 
the linear functional x and we shall henceforth use the notation < x | •A ) to 
denote the number obtained by causing the linear functional ( x \ G S* to act 
on the ket 1 1 // } e S: 

X(\ » = < X | > (B-3) 

i _ 

The origin of this terminology is the word “bracket”, used to denote the symbol < |>. 
Hence the name “bra” for the left-hand side, and the name “ket” for the right-hand side 
of this symbol. 

c. CORRESPONDENCE BETWEEN KETS AND BRAS 

a. To every ket corresponds a bra 

The existence of a scalar product in S will now enable us to show that 
we can associate, with every ket | tp > e S, an element of S *, that is, a bra, 
which will be denoted by < q> |. 

The ket | <p > does indeed enable us to define a linear functional: 
the one which associates (in a linear way), with each ket 1 1 p } e S, a complex 
number which is equal to the scalar product (| (p >, | t/t >) of | t [/ } by |<p>. 
Let < <p | be this linear functional; it is thus defined by the relation: 

(B-4) 
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p. This correspondance is antilinear 

In the space S, the scalar product is antilinear with respect to the first 
vector. In the notation of (B-4), this is expressed by: 


(A, | (pi > + x 2 | <p 2 >, | ^ » = A*(| </>,), | » + m\ (p 2 >. I <A » 

= A* < <Pi I (A > + A* < <p 2 | iA > 

= at <</>, | + A*<<p 2 |) |<A> (B-5) 


It appears from (B-5) that the bra associated with the ket X 2 \ tp t > + A 2 | q> 2 > 
is the bra X\ < <p, | + Af < <p 2 \: 


^1 ) + A 2 *P2 y ==5 *' A* + /* \ @2 


(B-6) 


The ket => bra correspondence is therefore antilinear. 


comment: 

If A is a complex number, and | ^ > a ket, A | >A > is a ket (<? is a vector 
space). We are sometimes led to write it as | AiA > : 

| A iA > = A | tA > (B-7) 

One must then be careful to remember that < AiA | represents the bra 
associated with the ket | AiA ). Since the correspondence between a bra 
and a ket is antilinear, we have : 

<A<A| = A*<iA| (B-8) 

y. Dirac notation for the scalar product 

We now have at our disposal two distinct notations for designating the 
scalar product of | iA > by |<p>: (|<p>, | <A )) or < <p \ <A >,<</> | being the 
bra associated with the ket | <p >. Henceforth we shall use only the (Dirac) 
notation : < <p | iA ). Table (II-6) summarizes, in Dirac notation, the properties 
of the scalar product, already given in § A-l-b. 


< <p | <a > = < <a | <p >* 

< V i + X 2 \j/ 2 > = Aj < (p | (Aj > + A 2 < q> | <A 2 > 

< X 1 (p 1 + X 2 <p 2 | «A > = A? < (p 2 | «A > + Af < (p 2 \ iA > 

< iA | <A ) real, positive; zero if and only if | ^ > = 0 


(B-9) 

(B-10) 

(B-ll) 

(B-12) 


8. Is there a ket to correspond to every bra ? 

Although to every ket there corresponds a bra, we shall see in two examples chosen in 
that it is possible to find bras which have no corresponding kets. We shall later show why this 
difficulty does not hinder us in quantum mechanics. 

(/) Counter-examples chosen in 

For simplicity, we shall reason in one dimension. 
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/% + 00 

Let ^(x) be a sufficiently regular real function, such that dx <^(x) = 1, and having 

J -oo 

the form of a peak of width e and amplitude 1/e, centered at x = x 0 [see fig. I; £^(x) is, for 
example, one of the functions considered in § 1-b of appendix II]. If g ^ 0, ^(x)e^ (the 
square of its norm is of the order of 1/e). Denote by | > the corresponding ket: 

IO (B-13) 



FIGURE 1 

£^(jc) is a function which has a peak at x = x 0 
(of width £ and amplitude l/£), whose integral 
between — oo and + oo is equal to I. 


If e 0, | ^ ) e S x . Let < ^ | be the bra associated with this ket; for every | \jj ) e S xy 
we have: 

/» + oo 

<c:k> = = dx saw (b-' 4 ) 

" - oo 

Now let e approach zero. On the one hand: 

Lim <f^(x) = ijx) $ (B-15) 

c-* 0 

[the square of the norm of ^(x), which is of the order of 1/e, diverges when e —> 0]; 
therefore: 

(B-16) 

e —* 0 

On the other hand, when e —> 0, integral (B-14) approaches a perfectly well-defined limit, 
il/(x 0 ) [since, for sufficiently small e, \j/(x) can be replaced in (B-14) by ^(x 0 ) and removed from 
the integral]. Consequently, <^1 approaches a bra which we shall denote by < £ Xo \ : 
< £ xo | is the linear functional which associates, with every ket | ^ > of & x> the value «^(x 0 ) taken 
on by the associated wave function at the point x 0 : 

Lim < ^ | = < ^ \eS* 

e—0 

If|^>e^,<^ 0 k> = Hx 0 ) (B-17) 

Thus we see that the bra <( £ Xo \ exists , but no ket corresponds to it. 

In the same way, let us consider a plane wave which is truncated outside an interval of 
width L : 

v ( p L >(x) = —k— t lpoX,h if - + k (b-18) 

s/2nh 2 2 

with the function going rapidly to zero outside this interval (while remaining continuous 

and differentiable). We shall denote by | v^ ) the ket associated with i>p^(x): 

»«*(■*) 6 ^ x <==> I v%> >e^ (B-19) 
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The square of the norm of v { pp \ which is practically equal to LJ2nh 9 diverges if L —> oo. 
Therefore: 

Lim | v'pj > £ (B-20) 

L~* oo 

Now let us consider the bra < | associated with | v p ^ } >. For every | \jj > € S x9 we have : 

• w 

dx e~ tpoX/h \j/(x) (B-21) 

-m 

When L —> oo, < v p ^ \ > has a limit: the value \l/{p 0 ) of the Fourier transform \j/(p) of 

\j/(x) for p - p 0 . Therefore, when L —> oo, < vfj | tends towards a perfectly, well-defined 

bra < v nn I: 

x P o I 

Lim < ifj | = < v p J 6 <?* 

L-* oo 

If |<A >£<£,, <v po \il/} = \j/(p 0 ) (B-22) 

Here again, no ket corresponds to the bra < v po |. 

(«) Physical resolution of the preceding difficulties 

This dissymmetry of the correspondence between kets and bras is related, as the preceding 
examples show, to the existence of “continuous bases’' for $F X . Since the functions constituting 
these “bases" do not belong to & x9 we cannot associate a ket of & x with them. However, their 
scalar product with an arbitrary function of 3F X is defined, and this permits us to associate 
with them a linear functional in S x , that is, a bra belonging to S*. The reason for using such 
“continuous bases” lies in their usefulness in certain practical calculations. The same reason 
(which will become more apparent in what follows) leads us here to reestablish the symmetry 
between kets and bras by introducing “generalized kets”, defined using functions which are not 
square-integrable, but whose scalar product with every function of exists. In what follows, 
we shall work with “kets” such as | £ xo ) or |u po >, associated with £ xo (x) or v po (x). It must not 
be forgotten that these generalized “kets” cannot, strictly speaking, represent physical states. 
They are merely intermediaries, useful in calculations involving certain operations which we 
shall have to perform on the true kets of the space S x , which actually characterize realizable 
quantum states. 

This method poses a certain number of mathematical problems, which can be avoided 
by adopting the following physical point of view: | £ Xo ) (or | v po )) actually denotes | ^ ) 
(or | v { £ >) where e is very small (or L is very large) compared to all the other lengths in the 
problem we are considering. In all the intermediary calculations where | ) (or | >) 

appears, the limit e = 0 (or L —> oo) is never attained, so that one is always working in S x . 
The physical result obtained at the end of the calculation depends very little on the value of e, 
as long as e is sufficiently small with respect to all the other lengths: it is then possible to neglect e, 
that is, to set e = 0, in the final result (the procedure to be used for L is analogous). 

The objection could be raised that, unlike { £ X0 (x) }and { i> Po (x) }, { ^(x) } and { d^x) } 
are not orthonormal bases, insofar as they do not rigorously satisfy the closure relation. In 

fact, they fulfill it approximately. For example, the expression Jdx 0 Co( x )££(*') is a function 
of (x — x') which can serve as an excellent approximation for <5(x - x'). Its graphical repre¬ 
sentation is practically a triangle of base 2e and height —, centered at x - x f - 0 (appendix II, 

s 

§ 1-c-iv). If e is negligible compared to all the other lengths in the problem, the difference between 
this expression and <5(x — x') is physically inappreciable. 


< v' 


(L) 


I 
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In general, the dual space S* and the state space S are not isomorphic, 
except, of course, if S is finite-dimensional* : although to each ket 1 1 // ) of S 
there corresponds a bra < tj/ | in S*, the converse is not true. Nevertheless, we 
shall agree to use, in addition to vectors belonging to S (whose norm is finite), 
generalized kets with infinite norms but whose scalar product with every ket 
of S is finite. Thus, to each bra < <p | of S*, there will correspond a ket. 
But generalized kets do not represent physical states of the system. 


3. Linear operators 

a. DEFINITIONS 

These are the same as those of § A-l-c. 

A linear operator A associates with every ket | ip ) e S another ket 
| ij/' > e S, the correspondence being linear: 

= (B-23) 

A(X t | > + A 2 | * 2 » = A, A | > + X 2 A | iA 2 > (B-24) 

The product of two linear operators A and B, written AB, is defined in 
the following way: 

(AB) | >/, > = A(B | ^ » (B-25) 

B first acts on | ^ ) to give the ket B | ^ ); A then acts on the ket B | ^ ). 
In general, AB ^ BA. The commutator [A, B~\ of A and B is, by definition: 

[A, B] = AB - BA (B-26) 

Let | </> ) and | \p ) be two kets. We call the matrix element of A between 
| (p ) and 1 1 1> ), the scalar product : 

< <P | (A | lit » (B-27) 

Consequently, this is a number which depends linearly on | \{/ > and antilinearly on 
I <P >• 


b. EXAMPLES OF LINEAR OPERATORS: PROJECTORS 

a. Important comment about Dirac notation 

We have begun to sense, in the preceding, the simplicity and convenience 
of the Dirac formalism. For example, < <p | denotes a linear functional (a bra), 
and < \tf/ 2 >, the scalar product of two kets 1 1 // 1 ) and 1 1 // 2 >. The number 

associated by the linear functional < <p \ with an arbitrary ket | ) is then 

written simply by juxtaposing the symbols < (p \ and | ^ ): < q> | \j/ >. This is 
the scalar product of | ^ > by the ket | cp > corresponding to < <p | (which is 
why it is useful to have a one-to-one correspondence between kets and bras). 


* It is true that the Hilbert space [} and its dual space are isomorphic; however, we have taken for 
the wave function space & a subspace of L 2 , which explains why is" larger” than 
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Now assume that we write < tp | and 1 tp > in the opposite order: 

(B-28) 

We shall see that if we abide by the rule of juxtaposition of symbols, this 
expression represents an operator. Choose an arbitrary ket | x ) and consider: 

I * > < <P \X > (B-29) 

We already know that < tp | x ) is a complex number; consequently, (B-29) 
is a ket, obtained by multiplying | tp ) by the scalar < <p | ^ >. \ ip } ( cp \, 
applied to an arbitrary ket, gives another ket: it is an operator. 

Thus we see that the order of the symbols is of critical importance. Only 
complex numbers can be moved about with impunity, because of the linearity of 
the space $ and of the operators which we shall use. Indeed, if X is a number : 

|*>* = A|*> 

<«A U = *<* I 

AX | tp ) = XA | ip ) (where A is a linear operator) 

(<p\X\tpy = X<<p\tpy = <<p\tP}X (B-30) 

But, for kets, bras and operators, the order must always be carefully respected 
in writing the formulas : this is the price that must be paid for the simplicity of 
the Dirac formalism. 

p. The projector P^ onto a ket | ip ) 

Let | tp ) be a ket which is normalized to one: 

< | > = 1 (B-31) 

Consider the operator P^ defined by: 

P* = | * > < * | (B-32) 

and apply it to an arbitrary ket | <p ): 

P*\V> -\ + ><+\9> (B-33) 

P acting on an arbitrary ket | tp >, gives a ket proportional to | ip ). The 
coefficient of proportionality < ip \ tp ) is the scalar product of | <p ) by | ip ). 

The “geometrical” significance of P^ is therefore clear : it is the “orthogonal 
projection” operator onto the ket | ip ). 

This interpretation is confirmed by the fact that P% = P^ (projecting twice 
in succession onto a given vector is equivalent to projecting a single time). 
To see this, we write: 

Pl = P^ = |*><*|*><*| (B- 34 ) 

In this expression, < ip | ip > is a number, which is equal to 1 [formula (B-31)]. 
Therefore: 

(B-35) 
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y. Projector onto a subspace 

Let | (Pi >, | <P 2 >» —> | <P<, )» be <7 normalized vectors which are orthogonal 
to each other: 

< (Pi | <Pj > = Sij ; = 1, 2, q (B-36) 

We denote by the subspace of & spanned by these q vectors. 

Let P q be the linear operator defined by: 

P q = t | <Pi> < V, | (B-37) 

1= 1 

Calculating P q : 

P] = Z Z I <Pt > < <P. I <Pj > < <Pj I (B-38) 

i=ij=i 

we get, using (B-36): 

Pl = i i k,x»jl«u- il*,Xd = f. (B-39) 

i= 1 j= 1 i= 1 

P is therefore a projector. It is easy to see that P q projects onto the 
subspace & , since for any | ip > e $ : 

p q \ t > = Z > < <pi I^ > (b- 40 ) 

i — I 

P q acting on | i )/) gives the linear superposition of the projections of |»// ) 
onto the various | <p { ), that is, the projection of |«// ) onto the subspace <$ q . 


4. Hermitian conjugation 

a. ACTION OF A LINEAR OPERATOR ON A BRA 

Until now, we have only defined the action of a linear operator A on 
kets. We are now going to see that it is also possible to define the action of A 
on bras. 

Let < q> | be a well-defined bra, and consider the set of all kets | tp ). 
With each of these kets can be associated the complex number < | (A \ \p )), 

already defined above as the matrix element of A between | q> ) and | ij/ ). 
Since A is linear and the scalar product depends linearly on the ket, the number 

< (p | (A | ip » depends linearly on 1 1 p }. Thus, for fixed < <p | and A, we can 

associate with every ket | tp > a number which depends linearly on 1 1 p The 
specification of < <p \ and A therefore defines a new linear functional on the 
kets of S, that is, a new bra belonging to S’*. We shall denote this new bra 

by < (p | A. The relation which defines < <p \ A can thus be written : 


« <P I A) | <P > = < <p | (A | iP » 


(B-41) 
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The operator A associates with every bra < <p | a new bra ( <p \ A. Let us 
show that the correspondence is linear. In order to do this, consider a linear 
combination of bras < | and < <p 2 |: 

< 1 | = K < <Pi | + X 2 < (p 2 | (B-42) 

(which means that | *P ) = ^1 (. <Pi | *P ) + A 2 (. V 2 I *P ))• From (B-41), we 
have : 

«x I I * > = < x IM K » 

= a 1 <<p 1 |M|*» + a 2 <«p 2 |04|*» 

= A, (<9), | A) I *p > + A 2 K <p 2 H) I * > (B-43) 

Since | ip > is arbitrary, it follows that: 

< X M “ Wi < Vi | + *2 < V2 M 

= A, < | A + X 2 < «p 2 | A (B-44) 

Equation (B-41) therefore defines a linear operation on bras. The bra 

< (p | A is the bra which results from the action of the linear operator A on 

the bra < (p |. 

comments : 

( 1 ) From definition (B-41) of < q> \ A, we see that the place of the parenthesis 
in the symbol defining the matrix element of A between | <p > and | ip > 
is of no importance. Therefore, we shall henceforth designate this matrix 
element by the notation < <p | A \ \p >: 

-< (P \A | *1* > = « <P | A) | ip > = < q> | (A | ip » (B-45) 

(i'j) The relative order of < q> | and A is very important in the notation 

< q> | A (cf. § 3-b-a above). One must write < | A and not A < <p |: < cp | A 

acting on a ket | ip > gives a number <<p|/!|^>;<<p|,4is therefore indeed 
a bra. On the other hand, A < <p j, acting on a ket | ip ), would give 
^4 < (p | \p ), that is, an operator (the operator A multiplied by the number 

< </> | tp )). We have not defined any mathematical object of this sort: 
A < (p | therefore has no meaning. 

b. THE ADJOINT OPERATOR A ,* OF A LINEAR OPERATOR A 

We are now going to see that the correspondence between kets and bras, 
studied in § B-2-c, enables us to associate with every linear operator A another 
linear operator A', called the adjoint operator (or Hermitian conjugate) of A. 

Let 1 1 p > then be an arbitrary ket of 8. The operator A associates with it 
another ket | ip' > = A \ ip > of 8 (fig. 2). 

To the ket | ip ) corresponds a bra <^|; in the same way, to | ip") cor¬ 
responds < \p' |. This correspondence between kets and bras thus permits us to define 
the action of the operator A 1 on the bras : the operator A' associates with the 
bra < ip | corresponding to the ket | ip ), the bra < \p' \ corresponding to the 
ket | ip ') = A | \p ). We write : ( ip' \ = ( ip \ A\ 
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A - —X \ *l/‘> = A \ \//> 


W - > < ^ | = < * | A» 


FIGURE 2 

Definition of the adjoint opera¬ 
tor A* of an operator A using the 
correspondence between kets and 
bras. 


Let us show that the relation < | = < if/ \ A 1 is linear. We know that, to the 

bra A, < t//j | + A 2 < \l/ 2 1, corresponds the ket AJ | ip 1 ) + AJ | ^ 2 > (the correspon¬ 
dence between a bra and a ket is antilinear). The operator A trans¬ 
forms A* 1 1 /,, > + A* | \\t 2 > into X*A | > + A*A | > = A* | ^; > + Af | ^' 2 >. 

Finally, to this ket corresponds the bra: 

K < I + a 2 < p 2 1 = a, < ip t 1+ a 2 < 4 > 2 1 >i*. 

From this we conclude that: 


(A x < | + A 2 < il> 2 \)A' = A t < | ^ + A 2 < | /F 

^4 f is therefore a linear operator, defined by the formula : 

|<A'> = ^ |<A> ^<<A'| = <<H^ 


(B-46) 

(B-47) 


From (B-47), it is easy to deduce another important relationship satisfied 
by the operator A \ Using the properties of the scalar product, one can always 
write: 

<r\ <p> = <<p\r>* (B- 48 ) 

where | <p) is an arbitrary ket of S. Using expressions (B-47) for |t (/'} and < \(/' |, 
we obtain: 




(B-49) 


a relation which is valid for all | (p ) and 1 1 //}. 

COMMENT ABOUT NOTATION 

We have already mentioned a notation which can lead to confusion : 
|Aiand < Ai//1, where A is a scalar [formulas (B-7) and (B-8)]. The same 
problem arises with the expressions \A^} and (Aip |, where A is a linear 
operator. | A ^ ) is another way of designating the ket A | if/ >: 

| A > - A | ip > (B-50) 

< A | is the bra associated with the ket \Aip ). Using (B-50) and (B-47), we 
see that: 

< ^ (B-51) 

When a linear operator A is taken outside the bra symbol, it must be replaced 
by its adjoint A* (and placed to the right of the bra). 
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C. CORRESPONDENCE BETWEEN AN OPERATOR AND ITS ADJOINT 


By using (B-47) or (B-49), it is easy to show that 

(A f y = A (B-52) 

(XA)* = X*A f (where X is a number) (B-53) 

(A + By = A* + B* (B-54) 

Now let us calculate ( AB)\ To do this, consider the ket | <p > = AB \ \ji >. 
Write it in the form | (p > = A \ x >, setting | % > = B | \J/ >. Then : 

<<p\ = <^| (ABY = < X \A' = <^| BM* 

since < X | = < lA | From this, we deduce that: 


(ABY = B'A' 


(B-55) 


Note that the order changes when one takes the adjoint of a product of operators. 


COMMENT : 

Since (A') 1 = A, we can write, using (B-51): 

< A*<p | = < tp | (A*y = {(p I A 

Thus the left-hand side of (B-41) can be rewritten in the form < A’cp | \p ). In 
the same way, the right-hand side of this same equation can be put, with 
the notation of (B-50), into the form < <p | A tj/ ). From this results the 
following equation, sometimes used to define the adjoint operator A* of A : 

< A*<p | \J/ > = ((p\A\j/y (B-56) 


d. HERMITIAN CONJUGATION IN DIRAC NOTATION 

In the preceding section, we introduced the concept of an adjoint operator 
by using the correspondence between kets and bras. A ket 1 1 (/} and its correspon¬ 
ding bra < ip | are said to be “Hermitian conjugates” of each other. The opera¬ 
tion of Hermitian conjugation is represented by the wavy arrows in figure 2; we 
see that it associates A 1 with A. This is the reason why A 1 is also called the 
Hermitian conjugate operator of A. 

The operation of Hermitian conjugation changes the order of the objects 
to which it is applied. Thus we see in figure 2 that A \ ip} becomes <^1^*. The 
ket |^> is changed into <»^|, and A into A 1 . Moreover, the order is reversed. 
In the same way, we saw in (B-55) that the Hermitian conjugate of a product of 
two operators is equal to the product of the Hermitian conjugates taken in the 
opposite order. Finally, let us show that : 

(| u > < v\Y - | r > < u | (B-57) 

(| m> is replaced by <m|, and < v | by | v), and the order is changed). Applying 
relation (B-49) to the operator | u > < v |, we find : 

<*|(| «><H)‘k> = [<H(|«X»|)I*>]* (B-58) 


119 




CHAPTER II THE MATHEMATICAL TOOLS OF QUANTUM MECHANICS 


Now, if we use property (B-9) of the scalar product: 

K<p\(\u><v\)\'f'>~\* = <(p\u>*<v\\J/>* = (il/\v>(u\(p) 

= <*|(|»X«|)k> (B-59) 

By comparing (B-58) and (B-59), we can derive (B-57). 

The result of the operation of Hermitian conjugation on a constant 
remains to be found. We see from (B-6) and (B-53) that this operation 
simply transforms A into X* (complex conjugation). This is in agreement with 
the fact that <<p|i/'>* = <<M<p>. 

Therefore, the Hermitian conjugate of a ket is a bra, and vice versa; that 
of an operator is its adjoint; that of a number, its complex conjugate. In Dirac 
notation, the operation of Hermitian conjugation is very simple to perform; it 
suffices to apply the following rule: 


RULE 

To obtain the Hermitian conjugate (or the adjoint) of any expression 
composed of constants, kets, bras and operators, one must : 

— Replace f the constants by their complex conjugates 
the kets by the bras associated with them 
the bras by the kets associated with them 
the operators by their adjoints 

— Reverse the order of the factors (the position of the constants, neverthe¬ 
less, is of no importance). 


EXAMPLES 

A<«|/l|p>|w><^|isan operator (A and < u \ A \ v > are numbers). The 
adjoint of this operator is obtained by using the preceding rule: 

| ^ ) < w | < v | A’ | u ) A*, which can also be written A* < v \ A’ \ u ) | ^ > < w |, 
changing the position of the numbers X* and < v | A' \ u >. 

In the same way, A | u > < v | w > is a ket (A and < v \ w > are constants). The 
conjugate bra is < w 1 1 ; > < u | A*, which can also be written A* < w | v > < u |. 

e. HERMITIAN OPERATORS 

An operator A is said to be Hermitian if it is equal to its adjoint, that is, if: 

A = A 1 (B-60) 

Combining (B-60) and (B-49), we see that a Hermitian operator satisfies 
the relation: 

<^Mk> = <<pMI^>* (B-61) 

which is valid for all | <p > and | \p >. 

Finally, for a Hermitian operator, (B-56) becomes 

(A(p\ij/y = (<p\At> (B-62) 


120 




C. REPRESENTATIONS IN THE STATE SPACE 


We shall treat Hermitian operators in more detail later, when we consider 
the problem of eigenvalues and eigenvectors. Moreover, we shall see in chapter III 
that Hermitian operators play a fundamental role in quantum mechanics. 

If formula (B-57) is applied to the case where | u > = | v > = | \j/ >, we see that 
the projector P* = | ij/ > < \j/ | is Hermitian : 

PI = |*><*| = P* (B-63) 

COMMENT: 

The product of two Hermitian operators A and B is Hermitian only if [A, B~\ = 0. 

Indeed, if A = A' and B — B\ it can be shown using (B-55) that ( AB )* = B'A'= BA, 

which is equal to AB only if [A, 2?] = 0. 


C. REPRESENTATIONS IN STATE SPACE 


1. Introduction 

a. DEFINITION OF A REPRESENTATION 

Choosing a representation means choosing an orthonormal basis, either dis¬ 
crete or continuous, in the state space S. Vectors and operators are then repre¬ 
sented in this basis by numbers : components for the vectors, matrix elements for 
the operators. The vectorial calculus introduced in § B then becomes a matrix 
calculus with these numbers. The choice of a representation is, in theory, arbitrary. 
Actually, it obviously depends on the particular problem being studied: in each 
case, one chooses the representation which leads to the simplest calculations. 

b. AIM OF SECTION C 

Using the Dirac notation, and for any arbitrary & space, we are going to 
treat again all the concepts introduced in §§ A-2 and A-3 for discrete and conti¬ 
nuous bases of &. 

We shall write the two characteristic relations of a basis in Dirac notation : 
the orthonormalization and closure relations. Then we shall show how, using 
these two relations, it is possible to solve all specific problems involving a repre¬ 
sentation and the transformation from one representation to another. 


2. Relations characteristic of an orthonormal basis 


a. ORTHONORMALIZATION RELATION 


A set of kets, discrete ({ |«j) }) or continuous ({ | > }), is said to be 

orthonormal if the kets of this set satisfy the orthonormalization relation : 


< «,• | Uj > = < 5,7 


(C-l) 
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or 


< 


W a , > = - a') 


(C-2) 


It can be seen that, for a continuous set, < w a \ w a ) does not exist: the | w a > 
have an infinite norm and therefore do not belong to 8. Nevertheless, the vectors 
of 8 can be expanded on the |w a ). It is useful, consequently, to accept the | w a > 
as generalized kets (see the discussions in §§ A-3 and B-2-c). 


b. CLOSURE RELATION 

A discrete set, { | > }, or a continuous one, { \w x > }, constitutes a basis 

if every ket | \J /) belonging to 8 has a unique expansion on the | u t ) or the | w a ) 



(C-3) 

(C-4) 


Let us assume, moreover, that the basis is orthonormal. Then perform the 
scalar multiplication on both sides of (C-3) with < Uj |, and on both sides of (C-4) 
with < w a ,\. We obtain, using (C-l) or (C-2), expressions for the components c } 
or c(a'): 


< «j 1 

t > = C J 

(C-5) 


11 

(C-6) 


(C-7) 


Then replace in (C-3) c ; by < u { | ^ >, and in (C-4) c(a) by<w a |i^>: 

I > = £ Ci | «i > = I < | 4> > | «,• > 

i i 

I > = Jda c(a) \w a ) = Jda < w a | > | »v a > 

= Jda | w a > < w a | ^ > = Qda | w 4 > < w 9 > 

[since, in (C-7), we can place the number after the ket |n { >; in the 

same way, in (C-8), we can place the number < w a | if/ > after the ket | w a >]. 

Thus, we see two operators appear, £ | w f > < «,• | and j*da | w a > < w a |. They 


(C-8) 
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act on every ket |^> belonging to & to give the same ket | >j/ >. Since \ip} is 
arbitrary, it follows that: 

(C-9) 


(C-10) 


where 11 denotes the identity operator in 8 . Relation (C-9), or (C-10), is called the 
closure relation. Conversely, let us show that relations (C-9) and (C-10) express 
the fact that the sets { | u-, > } and { | w a > } constitute bases. For every | ip ) belong¬ 
ing to 8, we can write: 

I > = H | ^ | > = Z |«; > < «i | ^ > 

i 

= Z c .-1 u i > (C-11) 

i 

with: 

c,. -<«,|*> (C-12) 

In the same way: 

|^> =H|^> = P lWa) | tit > = |da | > < w a | if/ > 

da c(a) | w a > (C-13) 

with: 

c(a) = < ^ | i/r > (C-14) 

Thus, every ket |^> has a unique expansion on the \u t ) or on the |w a >. 
Each of these two sets therefore forms a basis, a discrete one or a continuous one. 
We also see that relation (C-9), or (C-10), spares us the need of memorizing expres¬ 
sions (C-12) and (C-14) for the components c- and c(a). 


COMMENTS : 

(i) We shall see later (§ E) that, in the case of the #-space, relations (A-32) and (A-57) 
can easily be deduced from (C-9) and (C-10). 

(h) Geometrical interpretation of the closure relation. 

From the discussion of § B-3-b, we see that £ | u f > < u ( | is a projector: the 

t 

projector onto the subspace S' spanned by | «, >, [ « 2 >. — 1 u i ) — If the | > form 

a basis, every ket of S can be expanded on the | u t >; the subspace S' is then identical 
with the S- space itself. Consequently, it is reasonable for £ | u t > < « f | to be equal to the 
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identity operator: projecting onto & a ket which belongs to £ does not modify this ket. 

The same argument can be applied to | da | w a ) < »v a |. 

We can now find an equivalent of the closure relation for the three-dimensional 
space of ordinary geometry, R 3 . If e,, e,, and e, are three orthonormal vectors of this 
space, and P,, P 2 and P 3 are the projectors onto these three vectors, the fact that 
{ e 1( e 2 , e 3 } forms a basis in R 3 is expressed by the relation 

P, + P 2 + P 3 =H (C-15) 

On the other hand, { e l5 e 2 } constitutes an orthonormal set but not a basis of R 3 . 
This is expressed by the fact that the projector P, + P 2 (which projects onto the plane 
spanned by e, and e 2 ) is not equal to H; for example : (P i + P 2 )e 3 = 0. 


Table (II-7) summarizes the only fundamental formulas which are required 
for any calculation to be performed in the { |wj> } or { |w a > } representation. 


{ | ) } representation 

{| w a > } representation 

< "i i «j > = <>ij 

= Zl», ><«,! =11 

i 

< | w a- > = ^( a - a ') 

c 

P M = da |X > < | = U 

A 


Table (II-7) 


3. Representation of kets and bras 

a. REPRESENTATION OF KETS 

In the { | Uf > } basis, the ket | ij /) is represented by the set of its components, 
that is, by the set of numbers c ( = < | •A >• These numbers can be arranged 

vertically to form a one-column matrix (with, in general, a countable infinity 
of rows): 

(< «i I ^ >\ 


u i I <A > J 

In the continuous { | H» a ) } basis, the ket | \p ) is represented by a continuous 
infinity of numbers, c(a) = < w a \ ip ), that is, by a function of a. It is then possible 
to draw a vertical axis, along which are placed the various possible values of a. 
To each of these values corresponds a number, < w, I ^ >: 
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a 


(C-17) 


<w. 


b. REPRESENTATION OF BRAS 

Let < <p | be an arbitrary bra. In the { | u i > } basis, we can write : 

(<p\ = <<p\^ = <<p\ P {Ui i = I < <P | | (C-18) 

t 

< q> | has a unique expansion on the bras < u t |. The components of < |, < <p \ >, 

are the complex conjugates of the components b { = < u ( \ (p ) of the ket | </> > 
associated with < (p |. 

In the same way, we obtain, in the { | ) } basis: 

<<p\ = <(p\^ = <(p\ = |da < q> | w a > < | (C-19) 

The components of <</> |, <<p|w , a >, are the complex conjugates of the com¬ 
ponents b{a.) = < w a | </> > of the ket | <p > associated with (<p\. 

We have agreed to arrange the components of a ket vertically. Before descri¬ 
bing how to arrange the components of a bra, let us show how the closure relation 
enables us to find simply the expression for the scalar product of two kets in terms 
of their components. We know that we can always place H between < <p \ and | ip > 
in the expression for the scalar product: 

<(p\*P> = = <<p\P {Ui) \'l'> 

= l<<p\»i><u i \iP> = Zbfc t (C-20) 

i i 

In the same way: 

<<p|^> = <(p| £ 0|^> = <<p| p {w j | ^ > 

da < <p | w a > < w a | ip > = Jda b*{ a) c(a) (C-21) 

Let us arrange the components < <p | u { ) of the bra < q> | horizontally, to form 
a row matrix (having one row and an infinite number of columns): 

« <p | «i > < cp | u 2 >. < <p | > .) (C-22) 

Using this convention, < (p \ \p ) is the matrix product of the column matrix which 
represents 1 1 p > and the row matrix which represents < <p \. The result is a matrix 
having one row and one column, that is, a number. 

In the { | w a > } basis, < <p \ has a continuous infinity of components < <p \ w a >. 
The various values of a are placed along a horizontal axis. To each of these values 
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corresponds a component ((p | ) of < <p \: 

(.< <P | w a >.) 

- t -* 


(C-23) 


COMMENT I 

In a given representation, the matrices which represent a ket | \j/ ) and the asso¬ 
ciated bra < | are Hermitian conjugates of each other (in the matrix sense): one 

passes from one matrix to the other by interchanging rows and columns and taking the 
complex conjugate of each element. 


4. Representation of operators 


a. REPRESENTATION OF A BY A "SQUARE" MATRIX 


Given a linear operator A, we can, in a { | > } or { | w a > } basis, associate 

with it a series of numbers defined by: 


af 

II 

*- 

U J> 

A( a, a') = < w a | A 

1 *V > 


(C-24) 

(C-25) 


These numbers depend on two indices and can therefore be arranged in 
a “square” matrix having a countable or continuous infinity of rows and columns. 
The usual convention is to have the first index fix the rows and the second, the 
columns. Thus, in the { | u t ) } basis, the operator A is represented by the matrix : 


/^u 

^12 

. . A u • 

A 

■^21 

^22 

. . a 2j • 



^i2 

• • • A u • 

. . 


V- : : ) 


(C-26) 


We see that the y'th column is made up of the components in the { | u t > } basis of 
the transform A \ Uj > of the basis vector | Uj >. 

For a continuous basis, we draw two perpendicular axes. To a point which 
has for its abscissa a' and for its ordinate a there corresponds the number A( a, a'): 


a' 



/ 


\ 



.... Aia, a') 



l 

• * * * 1 ” / 

/ 


(C-27) 
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C. REPRESENTATIONS IN THE STATE SPACE 


Let us use the closure relation to calculate the matrix which represents the 
operator AB in the { | «,•) } basis: 

<u t \AB\ Uj y = (u i \Al\B\u j y 

= < u i I ap m b I «.j > 

= £ < u i I A I «k > < «* I B I > (C-28) 

k 

The convention chosen above for the arrangement of the elements A (j [or A (a, a')] 
is therefore consistent with the one relating to the product of two matrices : 
(C-28) expresses the fact that the matrix representing the operator AB is the 
product of the matrices associated with A and B. 


b. MATRIX REPRESENTATION OF THE KET | > = A \ xj) > 


The problem is the following : knowing the components of 1 1 (/} and the 
matrix elements of A in a given representation, how can we calculate the components 
of ||>'> = A\\p} in the same representation? 

In the { |m,) } basis, the coordinate c\ of | \j /') are given by: 

e' t = < | > = < «,• \A | + > (C-29) 

If we simply insert the closure relation between A and | tj/ >, we obtain: 

c;. = < «,. | /lD| i^ > = < «, | AP (UJ) | ip > 

= Z < u i | A I u j > < 1 1 > 

j 

= I ^ Cj (COO) 

J 

For the { | w a } } basis, we obtain, in the same way: 
c'(a) = < w, | f > = < w a | A | \J/ > 

= j*da' < w a | >1 | w a , > < uv | ip > 

= j* da'/l(a, a') c(a') (C-31) 

The matrix expression for | ^ | ^ > is therefore very simple. We see, for 

example from (C-30), that the column matrix representing | ij/' > is equal to the 
product of the column matrix representing | ip ) and the square matrix repre¬ 
senting A : 
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c. EXPRESSION FOR THE NUMBER < q> \A\ ij/ > 

By inserting the closure relation between < <p | and A and again between A 
and | \j/ >, we obtain: 

— for the { | w, > } basis: 

< fl> M I* > - < «> I I * > 

= L < <f I > < "i I -4 I "/ > < “j | ^ > 

ij 

= I bt A tj Cj (0-33) 

i.j 

— for the { | w a ) } basis: 


<<p\A\4t> = <<P \ I ^ > 



da da' < (p \ w a ) < w 


A | w„. > < w„. | > 



da da' h*(a) ^4(a, a') c(a') 


(0-34) 


The interpretation of these formulas in the matrix formalism, is as follows : 
< (p | A | ^ > is a number, that is, a matrix with one row and one column, 
obtained by multiplying the column matrix representing | > first by the square 

matrix representing A and then by the row matrix representing (<p\. For example, 
in the { | u t > } basis: 
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COMMENTS : 


( i ) It can be shown in the same way that the bra < (p | A is represented by a row matrix, the 
product of the square matrix representing A by the row matrix representing < <p | [the 
first two matrices of the right-hand side of (C-35)]. Again we see the importance of the 
order of the symbols : the expression A < cp | would lead to a matrix operation which is 
undefined (the product of a row matrix by a square matrix). 

(ii) From a matrix point of view, equation (B-41) which defines < | A merely expresses 

the associativity of the product of the three matrices which appear in (C-35). 

(Hi) Using the preceding conventions, we express I \j/ > < ^ j by a square matrix: 


c l c l ClCl • • * C t C] 


c 2 c\ c 2 c\ . . . c?ct 




(C-36) 


This is indeed an operator, while < if/ \ \f/ >, the product of a column matrix by a row matrix, 
is a number. 


MATRIX REPRESENTATION OF THE ADJOINT A' OF A 


Using (B-49), we obtain easily: 

= < M; | A* | Uj > = < Uj | A I M,. >* = Afi 


(C-37) 


a') = < w a | A* | w a . > = < w a . \ A w a >* = A*( a', a) 


(C-38) 


Therefore, the matrices representing A and A* in a given representation are 
Hermitian conjugates of each other, in the matrix sense: one passes from one 
to the other by interchanging rows and columns and then taking the complex conju¬ 
gate. 

If A is Hermitian, A* = A, and we can then replace ( A t ) iJ by A i} in (C-37), 
and A'loc, a') by A( a, a') in (C-38): 


Au = Af, 


A(oc, a') = A*( a', a) 


(C-39) 

(C-40) 


A Hermitian operator is therefore represented by a Hermitian matrix, that is, one 
in which any two elements which are symmetric with respect to the principal diagonal 
are complex conjugates of each other. In particular, for i = j or a = a', (C-39) 
and (C-40) become: 


A ti = Afi 
A(ix, a) = A*(a, a) 


(C-41) 

(C-42) 


The diagonal elements of a Hermitian matrix are therefore always real numbers. 
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5. Change of representations 

a. OUTLINE OF THE PROBLEM 

In a given representation, a ket (or a bra, or an operator) is represented by a 
matrix. If we change representations, that is, bases, the same ket (or bra, or 
operator) will be represented by a different matrix. How are these two matrices 
related ? 

For the sake of simplicity, we shall assume here that we are going from one 
discrete orthonormal basis { |« ; > } to another discrete orthonormal basis { | t k ) }. 
In § E, we shall study an example of changing from one continuous basis to another 
continuous basis. 

The change of basis is defined by specifying the components < \ t k > of 

each of the kets of the new basis in terms of each of the kcts of the old one. We shall 


set: 

S* = <«,K> (C- 43) 

S is the matrix of the basis change (transformation matrix). If S' denotes its 
Hermitian conjugate: 

(S’)*,-=($*)* = < < J «,• > (C-44) 

The following calculations can be performed very easily, and without memo¬ 
rization, by using the two closure relations: 

P im) = Z I I = ^ (045) 

I 

= 11 '»><«. I = H (C-46) 

k 

and the two orthonormalization relations: 

<«<k> = *ij (C-47) 

< t k \t, > = S kt (048) 


COMMENT .* 

The transformation matrix, S 9 is unitary (complement C]f). That is, it satisfies: 
S'S = SS' = / (C-49) 

where / is the unit matrix. Indeed, we see that: 

(S’S)*, = I SI&, = I < t k I «, > < M, I f, > 

i i 

= <h|t«> = -5« (C-50) 

In the same way: 

(ss%j = ^s ik sl J = I<u i \t k }<t k \u ] y 

k k 

= <«,>;> = <5,7 (C-51) 
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b. TRANSFORMATION OF THE COMPONENTS OF A KET 

To obtain the components < t k \ > of a ket | \J/ > in the new basis from its 

components < u t 1 ^ > in the old basis, one simply inserts (C-45) between < t k \ 
and | \J/ >.: 

<t k \4f> = <^| e 0|^> = <t k \P lUi) \*> 

= X < h I «i > < u t I ^ > 

i 

= Z < «i I ^ >’ (C-52) 

i 

The inverse expressions can be derived in the same way, using (C-46): 

< «,• | ^ > = < «i | H | ^ > = < «i I p,,*, I > 

= X < I I ^ > 

(C-53) 


C. TRANSFORMATION OF THE COMPONENTS OF A BRA 

The principle of the calculation is exactly the same. For example: 

<*K> = <*1*010 = <*\p M \t k > 

= X < * | «< > < «i | h > 

t 

= I < * I > S ik (C-54) 

i 

d. TRANSFORMATION OF THE MATRIX ELEMENTS OF AN OPERATOR 

If, in < t k | A 1 1, >, we insert (C-45) between < t k \ and A, and again between A 
and |r,>, we obtain: 

Ok | A | */ y — Ok | P m A P( U j j | h > 

= X O* | «i > < «.- M I u j > < “/ K > (C- 55 ) 

i.j 

that is: 

A, - I S' u A u Sj, (C-56) 

Uj 

In the same way: 

A ij = < «, M | Uj > = < u, | P„ kl i4 | > 

“ I < “( K > < *k M I> < h I > 

kj 

= X S’lj (C-57) 

k,l 
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D. EIGENVALUE EQUATIONS. OBSERVABLES 

1. Eigenvalues and eigenvectors of an operator 
a. DEFINITIONS 

| ij /) is said to be an eigenvector (or eigenket) of the linear operator A if: 

A\4 CD-I) 

where A is a complex number. We are going to study a certain number of properties 
of equation (D-l), the eigenvalue equation of the linear operator A. In general, this 
equation possesses solutions only when A takes on certain values, called eigenvalues 
of A. The set of the eigenvalues is called the spectrum of A. 

Note that, if | \J/ ) is an eigenvector of A with the eigenvalue A, a | ip ) (where a 
is an arbitrary complex number) is also an eigenvector of A with the same eigen¬ 
value : 

A(a | + » = %A | tl/ > = aA \ + > = A(a \ 4 , » (D-2) 

To rid ourselves of this ambiguity, we could agree to normalize the eigenvectors to I : 

(D-3) 

But this does not completely remove the ambiguity, since e w | ip >, where 6 is an arbitrary real 
number, has the same norm as | tp >. We shall see below that, in quantum mechanics, the 
physical predictions obtained using | ip > or e i# | \p > are the same. 

The eigenvalue X is called nondegenerate (or simple) when its corresponding 
eigenvector is unique to within a constant factor, that is, when all its associated 
eigenkets are collinear. On the other hand, if there exist at least two linearly 
independent kets which are eigenvectors of A with the same eigenvalue, this 
eigenvalue is said to be degenerate. Its degree (or order) of degeneracy is then the 
number of linearly independent eigenvectors which are associated with it (the 
degree of degeneracy of an eigenvalue can be finite or infinite). For example, if X 
is 0 -fold degenerate, there correspond to it g independent kets | ip' ) (/ = 1 , 2 , ... g) 
such that: 

A | r > = A | *■' > (D-4) 

But then every ket | \p > of the form: 

I * > = t c i I ^ > (D-5) 

i= 1 

is an eigenvector of A with the eigenvalue A, whatever the coefficients c ; , since : 

A I 4t > = X c i A I V > = ^ Z c > I ¥ > = A | ^ > (D* 6 ) 

i =1 1 = 1 

Consequently, the set of eigenkets of A associated with A constitutes a g-dimensional 
vector space (which can be infinite-dimensional), called the “ eigensubspace" of the 
eigenvalue A. In particular, it is equivalent to say that A is non-degenerate or to say 
that its degree of degeneracy is g = 1 . 
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To illustrate these definitions, let us choose the example of a projector (§ B-3-b): 
P* — | \j/ > < \j/ | (with < ij/ | \j/ > = 1). Its eigenvalue equation is written: 

P+ I <P > = ^ | <P > 

that is, 

| ^ > < ^ \<P> = (D-7) 

The ket on the left-hand side is always collinear with | ij/ > or zero. Consequently, the 
eigenvectors of P^ are : on the one hand, | ij/ > itself, with an eigenvalue of A = 1; on the other 
hand, all the kets | <p > orthogonal to | \j/ >, for which the associated eigenvalue is A = 0. 
The spectrum of P^ therefore includes only two values : 1 and 0. The first one is simple, the 
second, infinitely degenerate (if the state space considered is infinite-dimensional). The eigen- 
subspace associated with A = 0 is the supplement* of | ij/ ) (see §D-2-c). 

COMMENTS : 

(?) Taking the Hermitian conjugate of both sides of equation (D-l), we obtain : 

(iP\A' = X*(iP\ (D-8) 

Therefore, if | ^ > is an eigenket of A with an eigenvalue X, it can also be said 
that <^ | is an eigen bra of A 1 with an eigenvalue X*. However, let us stress 
the fact that, except in the case where A is Hermitian (§ D-2-a), nothing can 
be said a priori about < \p \ A. 

(ii) To be completely rigorous, one should solve the eigenvalue equation (D-l) in the spaced. 
That is, one should consider only those eigenvectors | tp > which have a finite norm. 
In fact, we shall be obliged to use operators for which the eigenkets do not satisfy this 
condition (§ E). Therefore, we shall grant that vectors which are solutions of (D-l) can 
be “generalized kets”. 

b. FINDING THE EIGENVALUES AND EIGENVECTORS OF AN OPERATOR 

Given a linear operator A, how does one find all its eigenvalues and the 
corresponding eigenvectors? We are concerned with this question from a purely 
practical point of view. We shall consider the case where the state space is of finite 
dimension N, and we shall grant that the results can be generalized to an infinite¬ 
dimensional state space. 

Let us choose a representation, for example, { |u,-> }, and let us project 
the vector equation (D-l) onto the various orthonormal basis vectors |m,>: 

= (D-9) 

Inserting the closure relation between A and | \p ), we obtain: 

Y J <u i \A\u j y<u j \ipy = X(iii\ipy (d-io) 

j 

* In a vector spaced, two sub-spaces <£, and S 2 are said to be supplementary if all kets | ^ > of<£ 
can be written | \p ) = | i// l ) + |^ 2 > where | tj/ i ) and | \j / 2 ) belong, respectively, tp S x and $ 2 , and if 
and S 2 are disjoint (no common non-zero ket; the expansion | ^ > = I Pl > + | > ' S then unique). 

Actually, there exists an infinity of sub-subspaces <£ 2 supplementary to a given sub-space S x . One can fix $ 2 
by forcing it to be orthogonal to . This shall be done throughout this book, even though the word “ortho¬ 
gonal” will not be explicitly written before supplement. 

Example: In ordinary three-dimensional space, if S x is a plane P, S 2 can be any arbitrary straight 
line, not contained in P. The orthogonal supplement of is the straight line passing through the origin 
and orthogonal to P. 
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With the usual notation : 

< «,• | > = c. 


JC 

K 

w. 

II 

> 

Cl 

(D-l 1) 

equations (D-10) can be written: 


CT 

II 

or 

(D-12) 

or 


I [X fj - = 0 

(D-13) 


J 


(D-13) can be considered to be a system of equations where the unknowns are 
the Cj, the components of the eigenvector in the chosen representation. This system 
is linear and homogeneous. 

a. The characteristic equation 

The system (D-13) consists of N equations (i = 1, 2, ..., N) in N unknowns Cj 
(j = 1,2 , N). Since it is linear and homogeneous, it has a non-trivial solution 
(the trivial solution is the one for which all the c- are zero) if and only if the 
determinant of the coefficients is zero. This condition is written: 

Det [.j* - XI] = 0 | (D-14) 


where si is the N x N matrix of elements A- tj and I is the unit matrix. 

Equation (D-14), called the characteristic equation (or secular equation), 
enables us to determine all the eigenvalues of the operator A , that is, its spectrum. 
(D-14) can be written explicitly in the form: 


A n X 

^12 

A 13-‘ 

* 


^21 

^22 — A 

A 23 ‘ • 

* Ain 

= 0 

a ni 

A N2 

A m • ■ 

• Ann ~~ ^ 



(D.15) 


This is an Mh order equation in X ; consequently, it has N roots, real or imaginary, 
distinct or identical. It is easy to show, by performing an arbitrary change of basis, 
that the characteristic equation is independent of the representation chosen. 
Therefore, the eigenvalues of an operator are the roots of its characteristic equation. 

(3. Determination of the eigenvectors 

Now let us choose an eigenvalue X 0 , a solution of the characteristic equa¬ 
tion (D-14), and let us look for the corresponding eigenvectors. We are going to 
distinguish between two cases: 

(/) First, assume that X 0 is a simple root of the characteristic equation. We 
can then show that the system (D-13), when X = X 0 , is comprised of (N — 1) 
independent equations, the Mh one following from the preceding ones and hence 
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redundant. But we have N unknowns; there is therefore an infinite number of 
solutions, but all the Cj can be determined in a unique way in terms of one of them, 
say c\ . If we fix c x , we obtain for the (N — 1) other c } a system of (jV — 1) linear, 
inhomogeneous equations (the “right-hand side” of each equation is the term in c t ) 
with a non-zero determinant [the (N — 1) equations are independent]. The solution 
of this system is of the form : 

Cj = a? c, (D-16) 

since the initial system (D-13) is linear and homogeneous, a° is, of course, equal 
to 1 by definition, and the (N — 1) coefficients a? for j # 1 are determined from 
the matrix elements A l} and A 0 . The eigenvectors associated with A 0 differ only 
by the value chosen for c,. They are therefore all given by: 

I <M c i) > = Z a ° c i\ u i > = c i I •Ao > (D- 17 ) 

j 

with: 

|*o> = I«“| »;> (D-18) 

j 

Therefore, when A 0 is a simple root of the characteristic equation, only one eigen¬ 
vector corresponds to it (to within a constant factor): it is a non-degenerate eigen¬ 
value. 

(//) When A 0 is a multiple root of order q > 1 of the characteristic equation, 
there are two possibilities: 

— in general, when A = A 0 , the system (D-13) is still composed of (TV — 1) 
independent equations. Only one eigenvector then corresponds to the eigenvalue A 0 . 
The operator A cannot be diagonalized in this case : the eigenvectors of A are not 
sufficiently numerous for one to be able to construct with them alone a basis of the 
state space. 

— nevertheless, when A = A 0 , it may happen that the system (D-13) has 
only (N — p) independent equations (where p is a number greater than 1 but not 
larger than q). To the eigenvalue A 0 there then corresponds an eigensubspace of 
dimension />, and A 0 is a p-fold degenerate eigenvalue. Let us assume, for example, 
that, for A = A 0 , (D-13) is composed of (N — 2) linearly independent equations. 
These equations enable us to calculate the coefficients c } in terms of any two of 
them, for example c x and c 2 : 

cj = fic x + y°c 2 (D-19) 

(obviously : = y 2 = 1; y? = = 0). All the eigenvectors associated with A 0 

are then of the form: 

I ^o(C|' c i) > = Cl I ill > + c 2 I > (D-20) 

with: 

l*S> = I#k> 

j 
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I *1 > -1 y° I ", > < D - 2I > 

j 

The vectors |iAo( c i* c 2 )) do indeed constitute a two-dimensional vector space, 
this being characteristic of a two-fold degenerate eigenvalue. 

When an operator is Hermitian, it can be shown that the degree of degener¬ 
acy p of an eigenvalue A is always equal to the multiplicity q of the corresponding 
root in the characteristic equation. Since, in most cases, we shall be studying only 
Hermitian operators, we shall only need to know the multiplicity of each root 
of (D-14) to obtain immediately the dimension of the corresponding eigensubspace. 
Thus, in a space of finite dimension N, a Hermitian operator always has N linearly 
independent eigenvectors (we shall see later that they can be chosen to be ortho¬ 
normal) : this operator can therefore be diagonalized (§ D-2-b). 


2. Observables 

a. PROPERTIES OF THE EIGENVALUES AND EIGENVECTORS 
OF A HERMITIAN OPERATOR 

We shall now consider the very important case in which the operator A is 
Hermitian : 

A' = A (D-22) 

(/) The eigenvalues of a Hermitian operator are real 

Taking the scalar product of the eigenvalue equation (D-l) by | \J/ >, we obtain: 

< + \A |*> -A<*|*> (D-23) 

But < \J/ 1 A | ) is a real number if A is Hermitian, as we see from: 

= <*\A'\iP> = ( + \A |*> (D-24) 

where the last equation follows from hypothesis (D-22). Since (il/\A\ipy and 
< \j /1 \j/ > are real, equation (D-23) implies that A must also be real. 

If A is Hermitian, we can, in (D-8), replace A by A* and A by A*, since we have 
just shown that A is real. Thus we obtain: 

(ip\A = A<i/f | (D-25) 

which shows that < \ is also an eigenbra of A with the real eigenvalue A. Therefore, 

whatever the ket | <p ): 

<^|^|<p>=A<iA|(/)> (D-26) 

The Hermitian operator A is said to act on the left in (D-26). 

(ii ) Two eigenvectors of a Hermitian operator corresponding to two different eigen¬ 

values are orthogonal. 

Consider two eigenvectors | > and | <p > of the Hermitian operator A : 

A | ^ > = A | ^ > (D-27-a) 

A\(p) = p\(py (D-27-b) 
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Since A is Hermitian, (D-27-b) can be written in the form: 

(<p\A=n(<p | (D-28) 

Then multiply (D-27-a) by < <p | on the left and (D-28) by | \p > on the right: 

<<p\A\t> = H<P |<A> (D-29-a) 

<(p \A |^> = n(<p\\f/} (D-29-b) 

Subtracting (D-29-b) from (D-29-a), we find: 

(A-/*)<<? |^> = 0 (D-30) 

Consequently, if (A — n) # 0, | tp > and | > are orthogonal. 


b. DEFINITION OF AN OBSERVABLE 


When & is finite-dimensional, we have seen (§ D-l-b) that it is always possible 
to form a basis with the eigenvectors of a Hermitian operator. When $ is infinite¬ 
dimensional, this is no longer necessarily the case. This is why it is useful to introduce 
a new concept, that of an observable. 

Consider a Hermitian operator A. For simplicity, we shall assume that the 
set of its eigenvalues forms a discrete spectrum { a n ; n = 1 , 2 ,...}, and we shall 
indicate later the modifications that must be made when all or part of this spectrum 
is continuous. The degree of degeneracy of the eigenvalue a n will be denoted 
by g n (if g n — 1 , a n is non-degenerate). We shall denote by 1 1 ^ > (i = 1 , 2, ..., g n ) 
g n linearly independent vectors chosen in the eigensubspace S n of a n : 

A\r n > = a n \r H >; i = 1, 2, ..., g n (D-31) 

We have just shown that every vector belonging to S n is orthogonal to every 
vector of another subspace i n ., associated with a n . # a n ; therefore: 

< ifrl, | ipi ') = 0 for n ^ n' and arbitrary i and j (D-32) 

Inside each subspace S n , the 1 1 /^> can always be chosen orthonormal, that is, 
such that: 


= (D-33) 

If such a choice is made, the result is an orthonormal system of eigenvectors 
of A : the | \jj ' n ) satisfy the relations: 


< *1K >-<-■*«• 


(D-34) 


obtained by regrouping (D-32) and (D-33). 

By definition, the Hermitian operator A is an observable if this orthonormal 
system of vectors forms a basis in the state space. This can be expressed by the 
closure relation: 


i £ i*:x^i=i 

> = 1 


(D-35) 
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comments: 

(/) Since the g n vectors | ^' > (i = 1,2, g n ) which span the eigensubspace S n 

of a n are orthonormal, the projector P n onto this subspace & R can be 
written ( cf. § B-3-b-y): 

Pn = I I I (D-36-a) 

i = I 

The observable .4 is then given by: 

A = I (D-36-b) 

n 

(it is easy to verify that the action of both sides of this equation on all the 
kets | ipl ) gives the same result). 


(ii) Relation (D-35) can be generalized to include cases where the spectrum of eigenvalues 
is continuous by using the rules given in table (II-3), For example, consider a Hermitian 
operator whose spectrum is composed of a discrete part { a n (degree of degeneracy g n ) } 
and a continuous part a(v) (assumed to be non-degenerate): 

A I ^n> = a n | ^>; » = h 2 , - 

‘ = b 2, ... g„ (D-37-a) 

A | > = fl(v) | t^ v >; v, < v < v 2 (D-37-b) 

These vectors can always be chosen in such a way that they form an orthonormal ” 
system: 

< K | in' > = A„Ar 

< <u = ^( v - v ') 

< ^ kv > = ° (°- 38 ) 

^ will be said to be an observable if this system forms a basis, that is, if : 

£ I I I + f dv | | = H (D-39) 

n 1=1 •'v. 


c. EXAMPLE: THE PROJECTOR P+ 

Let us show that P^ = | \{/ ) < | (with < ^ | ^ ) = 1) is an observable. We 

have already pointed out (§ B-4-e) that it is Hermitian, and that its eigenvalues 
are 1 and 0 (§ D-l-a); the first one is simple (associated eigenvector : |i^>), the 
second one is infinitely degenerate (associated eigenvectors : all kets orthogonal 
to | <A». 

Consider an arbitrary ket | <p ) in the state space. It can always be written in 
the form: 
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P+ | (p ) is an eigenket of P^ with the eigenvalue 1. Now, since P^ = P^: 

Wk>) = pIW> = p*\<p> (D-4i) 

(D — P+) | (p > is also an eigenket of P but with the eigenvalue 0, as we see from: 

P *{U - P t )\<P> = - P})\<P> = 0. (D-42) 

Every ket | (p > can thus be expanded on these eigenkets of P ^; therefore, 
P^ is an observable. 

We shall see in § E-2 two other important examples of observables. 


3. Sets of commuting observables 
a. IMPORTANT THEOREMS 

a. Theorem I 

If two operators A and B commute, and if | tp ) is an eigenvector of A, B\ i}/} is 
also an eigenvector of A, with the same eigenvalue. 

We know that, if | if/ ) is an eigenvector of A, we have : 

A \tf/ > = a | ^ > (D-43) 

Applying B to both sides of this equation, we obtain: 

BA fi/t) = aB\tf/} (D-44) 

Since we assumed that A and B commute, we also have, replacing BA on the left-hand 
side by AB: 

A(B\l,» = a(B\ty) (D-45) 

This equation expresses the fact that B | ^ ) is an eigenvector of A, with the eigen¬ 
value a; the theorem is therefore proved. 

Two cases may then arise: 

(/) If a is a nondegenerate eigenvalue, all the eigenvectors associated with 
it are by definition colinear, and £|^> is necessarily proportional to |i p}. 
Therefore | ^ ) is also an eigenvector of B. 

(ii) If a is a degenerate eigenvalue, it can only be said that B\ip > belongs 
to the eigensubspace i a of A, corresponding to the eigenvalue a. Therefore, 
for any | ^ ) e i a , we have : 

B (D-46) 

S a is said to be globally invariant (or stable) under the action of B. Theorem I can 
therefore be stated in another form: 

Theorem T: If two operators A and B commute, every eigensubspace of A 
is globally invariant under the action of B. 

p. Theorem II 

If two observables A and B commute, and if | ^ > and | ip 2 ) are two eigen¬ 
vectors of A with different eigenvalues, the matrix element ( i//j \ B\tp 2 } is zero. 
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If | t// x > and 1 1 l> 2 ) are eigenvectors of A, we can write: 

^|*2>=«2|*2> (D-47) 

According to theorem I, the fact that A and B commute means that B\t)/ 2 } is an 
eigenvector of A, with the eigenvalue a 2 . 5|i^ 2 > is therefore (cf § D-2-a) ortho¬ 
gonal to | \p l y (eigenvector of eigenvalue a 1 ^ a 2 ), which can be written: 

<^i |^|«A 2 > = 0 (D-48) 

The theorem is therefore proved. Another proof can be given, which does not 
involve theorem I: since the operator [ A, 5] is zero, we have: 

BA)\*I, 2 > = 0 (D-49) 

Using (D-47) and the Hermiticity of A [cf. equation (D-25)], we obtain: 

<* 1 |^*|* 2 >-a 1 <* 1 |*|* 2 > 

<^|ZM|^ 2 > = a 2 <^|5|^ 2 > (D-50) 

and (D-49) can be rewritten in the form: 

(<*i - a 2 )<^i = 0 ( D *51) 


Since, by hypothesis, (a l — a 2 ) is not zero, we can deduce (D-48) from this. 


y. Theorem III (fundamental) 

If two observables A and B commute, one can construct an orthonormal basis 
of the state space with eigenvectors common to A and B. 

Consider two commuting observables, A and B. In order to simplify the 
notation, we shall assume that their spectra are entirely discrete. Since A is an 
observable, there exists at least one orthonormal system of eigenvectors of A which 
forms a basis in the state space. We shall denote these vectors by | >: 

A l“«> = a „K>; n = 1 , 2 , ... 

i = 1, 2, ..., g n (D-52) 

g n is the degree of degeneracy of the eigenvalue a n , that is, the dimension of the 
corresponding eigensubspace S n . We have: 

> (D-53) 

What does the matrix look like which represents B in the { |m') } basis? We 
know (cf. theorem II) that the matrix elements < u' n \ B \ u l n , ) are zero when n ri 
(on the other hand, we can say nothing a priori about what happens for n = «' 
and i ^ «')• Let us arrange the basis vectors | u‘ n ) in the order : 


“i>» |«iX .... |«i 9l > ; [ >,|m/ 2 > ; |« 3>,... 
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We then obtain for B a “block-diagonal” matrix, that is, of the form: 



P-54) 


(only the shaded parts contain non-zero matrix elements). The fact that the 
eigensubspaces S n are globally invariant under the action of B (cf. § a) is evident 
from this matrix. 

Two cases can then arise: 

(i) When a n is a nondegenerate eigenvalue of A, there exists only one eigen¬ 
vector | u n > of A, of eigenvalue a n (the index i in |w„> is then unnecessary): the 
dimension g n of 8 n is then equal to 1. In the matrix (D-54), the corresponding “block”, 
then reduces to a 1 x 1 matrix, that is, to a simple number. In the column associated 
with | u n ), all the other matrix elements are zero. This expresses the fact (cf. § <x-i) 
that | u n ) is an eigenvector common to A and B. 

(ii) When a„ is a degenerate eigenvalue of A (g„ > 1), the “block” which 
represents B in 8 n is not, in general, diagonal : the | u l n ) are not, in general, eigen¬ 
vectors of B. 

It can be seen, nevertheless, that, since the action of A on each of the g„ 
vectors | u ‘) reduces to a simple multiplication by a n , the matrix representing 
the restriction of A to within 8 n is equal to a„ /(where / is the g„ x g n unit matrix). 
This expresses the fact that an arbitrary ket of 8 n is an eigenvector of A with the 
eigenvalue a„. The choice, in 8 n of a basis such as { | >; i = 1, 2,..., g n } is 

therefore arbitrary. Whatever this basis, the matrix representing A in 8 n is always 
diagonal and equal to a n I. We shall use this property to obtain a basis of 8 n composed 
of vectors which are also eigenvectors of B. 

The matrix representing B in S n , when the basis chosen is 

{ I U n y ’ *^ = •••> Qn } 5 

has for its elements: 

P-55) 

This matrix is Hermitian (jS^f* = j?{" ) ), since B is a Hermitian operator. It is therefore 
diagonalizable, that is, one can find in 8 n a new basis {| > ; i — 1,2,..., g n } in 

which B is represented by a diagonal matrix 

<i>‘|£|^> = ^6 l} (D-56) 


141 






CHAPTER II THE MATHEMATICAL TOOLS OF QUANTUM MECHANICS 


This means that the new basis vectors in i n are eigenvectors of B: 

B\v‘ n > =/?{"> |0 (D-57) 

As we saw above, these vectors are automatically eigenvectors of A with an eigen¬ 
value a n since they belong to fi n . Let us stress the fact that the eigenvectors of A 
associated with degenerate eigenvalues are not necessarily eigenvectors of B. What 
we have just shown is that it is always possible to choose, in every eigensubspace 
of A, a basis of eigenvectors common to A and B. 

If we perform this operation in all the subspaces S n , we obtain a basis of S, 
formed by eigenvectors common to A and B. The theorem is therefore proved. 


comments: 


(/) From now on, we shall denote by | u[ p ) the eigenvectors common to A and B. 

A i"i, P > 

(D-58) 

The indices n and p which appear in | m* p ) enable us to specify the eigenvalues a n 
and b p of A and B. The additional index i will eventually be used to distinguish 
between the different basis vectors which correspond to the same eigenvalues a n 
and b p (§ b below). 

(ii) The converse of theorem III is very simple to prove : if there exists a basis 
of eigenvectors common to A and B, these two observables commute. 

From (D-58), it is easy to deduce: 

AB | u ‘ p > = b p A | ui P > = b p a n \ < p > 

BA | u ‘, p > = a„ B | ulp > = bp | u‘ n , p > (D-59) 

and, subtracting these equations: 

\.A,B-] K, p > = 0 (D-60) 

This relation is valid for all i, n and p. Since, by hypothesis, the vectors | u' n p > 
form a basis, (D-60) entails [A, 5] = 0. 

(Hi) We shall occasionally solve the eigenvalue equation of an observable C such 
that: 

C = A + B with [A, B] = 0 (D-61) 

where A and B are also observables. 

When one has found a basis { | wi, p ) } of eigenvectors common to A 
and B, the problem is solved, since we see immediately that | u‘ n p ) is also an 
eigenvector of C, with an eigenvalue a„ + b p . The fact that ( | u' n p > } consti¬ 
tutes a basis is obviously essential : this allows us, for example, to show simply 
that all the eigenvalues of C are of the form a n + b p . 
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b. COMPLETE* SETS OF COMMUTING OBSERVABLES (C.S.C.O.)** 

Consider an observable A and a basis of $ composed of eigenvectors | u ' n ) 
of A. If none of the eigenvalues of A is degenerate, the various basis vectors of $ 
can be labelled by the eigenvalue a n (the index i in | u ' n ) being in this case unneces¬ 
sary). All the eigensubspaces S n are then one-dimensional. Therefore, specifying 
the eigenvalue determines in a unique way the corresponding eigenvector (to 
within a constant factor). In other words, there exists only one basis of S formed 
by the eigenvectors of A (we shall not consider here as distinct two bases whose 
vectors are proportional). It is then said that the observable A constitutes, by 
itself, a C.S.C.O. 

If, on the other hand, one or several eigenvalues of A are degenerate, the 
situation is different. Specifying a n is no longer always sufficient to characterize 
a basis vector, since there correspond several independent vectors to degenerate eigen¬ 
values. In this case, the basis of eigenvectors of A is obviously not unique. One can 
choose any basis inside each of the eigensubspaces S n of dimension greater than 1. 

Let us then choose another observable B which commutes with A, and let us 
construct an orthonormal basis of eigenvectors common to A and B. By definition, 
A and B form a C.S.C.O. if this basis is unique (to within a phase factor for each of 
the basis vectors), that is, if, to each of the possible pairs of eigenvalues { a„, b p J, 
there corresponds only one basis vector. 


comment . 

In §a, we constructed a basis of eigenvectors common to A and B by solving 
the eigenvalue equation of B inside each eigensubspace S K . For A and B to constitute 
a C.S.C.O., it is necessary and sufficient that, inside each of these subspaces, all the g„ 
eigenvalues of B be distinct. Since all the vectors of <f„ correspond to the same 
eigenvalue a n of A , the g n vectors 1 1 ^ ) can then be distinguished by the eigenvalue 
of B which is associated with them. Note that it is not necessary that all the eigenvalues of B 
be non-degenerate. Vectors | r' n ) belonging to two distinct subspaces 6 n can have the 
same eigenvalue for B. Moreover, if all the eigenvalues of B were non-degenerate, B alone 
would constitute a C.S.C.O. 

If, for at least one of the possible pairs { a n , b p }, there exist several 
independent vectors which are eigenvectors of A and B with these eigenvalues, 
the set { A, B ] is not complete. Let us add to it, then, a third observable C, 
which commutes with both A and B. We can then use the same argument 
as in § a above, generalizing it in the following way. When to a pair ( a„, b p J, 
there corresponds only one vector, this vector is necessarily an eigenvector of C. 
If there are several vectors, they form an eigensubspace S n p , in which it is possible 
to choose a basis formed by vectors which are also eigenvectors of C. One can 
thus construct, in the state space, an orthonormal basis formed by eigenvectors 
common to A , B and C. A, B and C form a C.S.C.O. if this basis is unique 


* The word "complete" is used here in a sense which is totally unrelated to those referred to in 
the note of §A-2-a, p. 97. This use of the word “complete" is customary in quantum mechanics. 

** To have a good understanding of the important concepts introduced in this section, the 
reader should apply them to a concrete example such as the one discussed in complement H„ (solved 
exercises II and 12). 
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(to within multiplicative factors). Specifying a possible set of eigenvalues 
{ a n , b p , c r } of A, B, C then characterizes only one of the vectors of this basis. 
If this is not the case, one adds to A, B, C an observable D which commutes 
with each of these three operators, and so on. In general, we are thus led to 
the following: 

By definition, a set of observables A, B, C... is called a complete set of 
commuting observables if 

( i) all the observables A, B, C... commute by pairs, 

(ii) specifying the eigenvalues of all the operators A, B, C... determines 
a unique (to within a multiplicative factor) common eigenvector. 

An equivalent way of saying this is the following: 

A set of observables A, B, C... is a complete set of commuting observables 
if there exists a unique orthonormal basis of common eigenvectors (to within 
phase factors). 

C.S.C.O.’s play an important role in quantum mechanics. We shall see 
numerous examples of them (see, in particular, § E-2-d). 

comments : 

(/) If { A, B } is a C.S.C.O., another C.S.C.O. can be obtained by adding to it 
any observable C, on the condition, of course, that it commutes with A 
and B. However, it is generally understood that one is confined to 
“minimal” sets, that is, those which cease to be complete when any 
one of the observables is omitted. 

(ii) Let { A, B, C } be a complete set of commuting observables. Since the 
specification of the eigenvalues a n , b p , c r ... determines a ket of the corres¬ 
ponding basis (to within a constant factor), this ket is sometimes denoted 

b y I b P . C r, - >• 

(iii) For a given physical system, there exist several complete sets of commuting 
observables. We shall see a particular example of this in § E-2-d. 


E. TWO IMPORTANT EXAMPLES OF REPRESENTATIONS 
AND OBSERVABLES 


In this paragraph, we shall return to the ^-space of wave functions of 
a particle, or, more exactly, to the state space 8 t which is associated with 
it and which we shall define in the following way. Let there correspond to 
every wave function ^(r) a ket 1 1 // > belonging to 8 ,\ this correspondence is 
linear. Moreover, the scalar product of two kets coincides with that of the 
functions which are associated with them: 


(E-l) 
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S t is thus the state space of a (spinless) particle. 

We are going to define and study, in this space, two representations and 
two operators which are particularly important. In chapter III we shall associate 
them with the position and the momentum of the particle under consideration. 
They will enable us, moreover, to apply and illustrate the concepts which we have 
introduced in the preceding sections. 


1. The { | r > } and { | p > } representations 

a. DEFINITION 

In §§ A-3-a and A-3-b, we introduced two particular “bases” of F : 

{ < s ro ( r )} and { t> po (r) }• They are not composed of functions belonging to F\ 

£ ro (r) = <5(r - r 0 ) (E-2-a) 

r,Jr) = (27rft)- 3 ' 2 e* P0 - r (E-2-b) 

However, every sufficiently regular square-integrable function can be expanded 
in one or the other of these “bases”. 

This is why we shall remove the quotation marks and associate a ket with 
each of the functions of these bases (cf. § B-2-c). The ket associated with 
£ ro (r) will be denoted simply by | r 0 >, and that associated with t> po (r), by | p 0 > : 


(E-3-a) 

(E-3-b) 


Using the bases { £ ro (r)} and { r po (r)} of F, we thus define in S t two 
representations : the { | r 0 > } representation and the { | p 0 ) } representation. 
A basis vector of the first one is characterized by three “continuous indices” 
x 0 , y 0 and z 0 , which are the coordinates of a point in three-dimensional space; 
for the second, the three indices are also the components of an ordinary 
vector. 


n>> -*■ i «■„ > 

MV.O’) —► I Po > 


b. ORTHONORMALIZATION AND CLOSURE RELATIONS 

Let us calculate < r 0 | >. Using the definition of the scalar product in 

*'■ 

< *o I r o > = £* 0 ( r ) £ r » = H r o ~ «o) (E-4-a) 

where relation (A-55) has been used. In the same way: 

< Po | Po > = J r v Po( r ) t, Po( r ) = %o ~ Po) (E-4-b) 
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using (A-47). The bases which we have just defined are therefore orthonormal 
in the extended sense. 

The fact that the set of the | r 0 > or that of the | p 0 > constitutes a 
basis in S t can be expressed by a closure relation in S t . This is written in an 
analogous manner to (C-10), integrating here, however, over three indices instead 
of one. 

We therefore have the fundamental relations: 


< r 0 | r' 0 > = 5(r 0 - r' 0 ) (a) 

< Po | Po > = <>(Po - Po) ( c ) 

j^o| r o><r 0 | = H (b) 

j</ 3 Po 1 Po > < Po 1 =H (d) 


c. COMPONENTS OF A KET 

Consider an arbitrary ket | i j/ ), corresponding to the wave function ^/(r). 
The preceding closure relations enable us to write it in either of these two 
forms: 


\'l' > = J d 3 r 0 | r 0 > < r 0 | ^ > (E-6-a) 

| if/ > = jVp 0 | p 0 > < p„ | tjt > (E-6-b) 

The coefficients < r 0 | t// > and < p 0 | ^ > can be calculated using the formulas: 

< r 0 | > = Jd 3 r £ r *( r ) H r ) (E-7-a) 

< Po I $> = ^d 3 r v* 0 (r) t(r) (E-7-b) 

We then find: 

< r 0 | \p > = ijArf) j (E-8-a) 

< Po I *P > = H Po) i (E-8-b) 


where t//(p) is the Fourier transform of i/f(r). 

The value i/^(r 0 ) of the wave function at the point r 0 is thus shown to be 
the component of the ket | i p } on the basis vector \ r 0 )>_of the { | r 0 ) } repre¬ 
sentation. The “wave function in momentum space” ip(p) can be interpreted 
analogously. The possibility of characterizing | ip ) by ip(r) is thus simply a 
special case of the results of § C-3-a. 

For example, for | tp > = | p 0 >, formula (E-8-a) gives: 

< r o I Po > = t, p 0 ( r o) = (2 nhy 312 e**"' 0 (E-9) 
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For | ^ ) = | Tq ), the result is indeed in agreement with the orthonormalization 
relation (E-5-a): 

< r O I r O > = £rf,( r o) = ^0 - r o) (E-10) 

Now that we have reinterpreted the wave function i^(r) and its Fourier 

transform i/r(p), we shall denote the basis vectors of the two representations 
we are studying here by [ r > and | p >, instead of | r 0 > and | p 0 >. Formulas (E-8) 
can then be written: 

< r | ip > = ip( r) (E-8-a) 

< P I > = «MP) (E-8-b) 

and the orthonormalization and closure relations (E-5) become: 


( 

II 

i 

p" 

< P | P' > = % - P ) 

(c) 


jd 3 r |r > <r | =1] (b) 

| p > < p | =n 

(d) 

(E- 5 ) 


Of course, r and p are still considered to represent two sets of continuous 
indices, { x, y, z } and { p x , p y , p z }, which fix the basis kets of the { | r ) } 
and { | p ) } representations respectively. 

Now let { «j(r) } be an orthonormal basis of &. With each «,(r) is associated a ket | u t ) 
of S t . The set { w ■ > )■ forms an orthonormal basis in S t \ it therefore satisfies the closure 
relation: 

II w,)<W||=H (E-ll) 

i 

Evaluate the matrix element of both sides of (E-ll) between | r > and | r' ): 

I<r|ui><«i|r' > = < r 111 r' > = <r|r'> (E-l 2) 

i 

According to (E-8-a) and (E-5-a), this relation can be written: 

14) <(0 = ^(r-r') (E-l3) 

I 

The closure relation for the { u,(r)} [formula (A-32)] is therefore simply the expression in 
the { | r > } representation of the vectorial closure relation (E-l 1). 

d. THE SCALAR PRODUCT OF TWO VECTORS 

We have defined the scalar product of two kets of «? r as being equal to 
that of the associated wave functions in & [equation (E-l)]. In light of the 
discussion in § c, this definition appears simply as a special case of formula (C-21). 
(E-l) can, in fact, be derived by inserting the closure relation (E-5-b) between 
< (p | and | il/ >: 

< (p | ^ > = f d 3 r <<p|r><r|^> (E-14) 
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and by interpreting the components < r | if/ ) and < r | ) as in (E-8-a). 

If we place ourselves in the { | p > } representation, a well-known property 
of the Fourier transform is demonstrated (appendix I, §2-c). 


< <p | <A > 


|^ 3 P < <p | P > < P I *1* > 
j*d 3 p <p*(p) i^(p) 


(E-15) 


e. CHANGING FROM THE { | r > } REPRESENTATION 

TO THE { | p > } REPRESENTATION 

This is accomplished using the method indicated in § C-5, the only difference 
arising from the fact that we are dealing here with two continuous bases. Changing 
from one basis to the other brings in the numbers : 

<r|p> = < P |r>* = (2nh)~ 3/2 e*”" (E-I6) 

A given ket | ^ > is represented by < r | i^ ) = in the { | r ) } repre¬ 
sentation and by < p | i/' ) = <A(p) in the { | p ) } representation. We already 
know [formula (E-7-b)] that i//(r) and i//(p) are related by a Fourier transform. 
This is indeed what the formulas for the representation change yield: 

< r 1 1 l>y = j* d 3 p <r|p><p|t/f> 
that is: 

ij/( r) = ( 2nh)~ 3/2 J 'd 3 p e* P ’ r ^(p) (E-17) 

Inversely: 

<p|< l>> = j* d 3 r <p|r><r|^> 
that is: 

ftp) = (2nh)- 312 j*d 3 r e~* pr ^(r) (E-18) 

By applying the general formula (C-56), one can easily pass from the 
matrix elements < r' | A \ r > = A( r', r) of an operator A in the { | r > } repre¬ 
sentation to the matrix elements < p' | A | p ) = A( p', p) of the same operator 
in the { | p ) } representation: 

d 3 r j*dV e» (pr_p r ) A( r', r) (E-19) 

An analogous formula enables one to calculate A( r', r) from A( p', p). 


A( p', P) = (2nhy 
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2. The R and P operators 

a. DEFINITION 

Let | (// ) be an arbitrary ket of S x and let < r | ^ ) — = i p(x, y, z) 

be the corresponding wave function. Using the definition of the operator X, 
the ket : 

\iP'> = X\ * > (E-20) 

is represented, in the { | r > } basis, by the function < r 1 1 p' } = i^'(r) = tf/'(x, y, z) 
such that: 

y, z) = x ij/(x, y, z) (E-21) 

In the { | r ) } representation, the X operator therefore coincides with 
the operator which multiplies by x. Although we characterize X by the way 
in which it transforms the wave functions, it is an operator which acts in 
the state space S t . We can introduce two other operators, Y and Z, in an 
analogous manner. Thus we define X, Y and Z by the formulas: 


(E-22-a) 

(E-22-b) 

(E-22-c) 


where the numbers x, y, z are precisely the three indices which label the ket | r ). 
X, Y and Z will be considered to be the “components” of a “vector operator” R : 
for the moment, we shall treat this simply as a condensed notation, suggested 
by the fact that x, y, z are the components of the ordinary vector r. 

Manipulation of the X, Y, Z operators is particularly simple in the 
{ | r ) } representation. For example, in order to calculate the matrix element 

< (p X \ij/ }, all we need to do is insert the closure relation (E-5-b) between 

< q> and X and use definition (E-22): 

< | A" | ^ > = (* d 3 r <<p|r><r|A'|^> 

d 3 r cp*( r) x (E-23) 

Similarly, we define the vector operator P by its components P x , P y , P z , 
whose action, in the { | p ) } representation, is given by: 


(E-24-a) 

(E-24-b) 

(E-24-c) 


< p 

p x 

11 

TO 

<A> 

< p 

Py 

> = Py( P 

<A> 

< p 

\P. 

1 t ) = Pz(P 

1 > 



<r 

X 

II 

= x< r 

*> 1 

<r 

Y 

N/ 

n 

= y < r 

</o 

< r ' 

| Z 

= 

= r<r 

l«A> 


where p x , p y , p z are the three indices which appear in the ket | p ). 

Let us ascertain how the P operator acts in the { | r ) } representation. 
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To do so (cf. §C-5-d), we use the closure relation (E-5-d) and the transformation 
matrix (E-16) to obtain: 

< r | P x | > = J<* 3 P <r|p><p|P x |^> 

= (2nn)~ 3 ' 2 Jd 3 p e^ pr p x *(p) (E-25) 

We recognize in (E-25) the Fourier transform of p x tf/( p), that 

is idx^ 

[appendix I, relation (38-a)]. Therefore: 

< r | P|i^> = -yV<r|i//> (E-26) 

In the { | r ) } representation, the P operator coincides with the differential 

operator —V applied to the wave functions. The calculation of a matrix element 
i 

such as < (p | P x | ^ ) in the { | r ) } representation is therefore performed in 
the following manner: 

< <P | P x | t I r > < r I p x I > 

= Jd 3 r <p*(r) ^(r) (E-27) 

Placing ourselves in the { | r) } representation, we can also calculate 
the commutators between the X, Y, Z, P x , P y , P z operators. For example : 

< r | [*, PJ | * > = < r | (XP x - P x X) | * > 

= x<r|Pj^> -£A.<r\X\ «A> 


ft 5 , ft d - 

= ift < r U > 


(E-28) 


This calculation is valid for all | if/ ) and for any ket of the 
Thus one finds*: 


[X,P x ] = if i 


| r ) basis. 
(E-29) 


In the same way, we find all the other commutators between the components 
of R and those of P. The result can be written in the form: 


= 0 

LP^Pj 0 *=o 

> ij = 1, 2, 3 

[/?,., PJ - iftS u 



(E-30) 


* The commutator [X, /* x ] is an operator, and it should, actually, be written [X, P x ] = fa II. 
However, we shall often replace the identity operator U by the number 1, except when it is important 
to make the distinction. 
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where R lf R 2 , R 3 , and P t , P 2 , P 3 designate respectively X, Y, Z and P x , P , P z . 
Formulas (E-30) are called canonical commutation relations. 


R AND P ARE HERMITIAN 


In order to show that X, for example, is a Hermitian operator, we can 
use formula (E-23): 


<<p x\ 


(r) x *(r) 


= j*d 3 r V* 

= £ jd 3 r \j/*{ r) x <p(r)J 

= < >A I X I cp >* 


(E-31) 


From §B-4-e, we know that equation (E-31) is characteristic of a Hermitian 
operator. 

Similar proofs show that Y and Z are also Hermitian. For P x , P f and P z , 
the { | p ) ) representation can be used, and the calculations are then analogous 
to the preceding ones. 


It is interesting to show that P is Hermitian by using equation (E-26), which gives its 
action in the { | r > } representation. Consider, for example, formula (E-27) and integrate it 
by parts: 


< <P | p x I > 


-Ijd, dz[ 


dx Yx ^ 


ti c rr ~ mx ~+°° a+oo ^ 

= -Jdy dz <j^(*>*(r) <A(r) - j dx iA(r) — <f>*(r) J 


(E-32) 


Since the integral which yields the scalar product < q> | if/ > is convergent, (p*(r)ip (r) approaches 
zero when x —> ± oo. The first term on the right hand side of (E-32) is therefore equal to 
zero, and: 


< <P | Px | t > = - T J^r M r) ^ (p*(r) 

- [7/ JV *' <r) 

= < * | P x \<P >* (E-33) 


It can be seen that the presence of the imaginary number 1 is essential. The differential 

operator acting on the functions of 3F y is not Hermitian, because of the sign change which 

8x 8 h 8 

is introduced by the integration by parts. However, i— is Hermitian, as is . 

dx 1 dx 


c. EIGENVECTORS OF R AND P 

Consider the action of the X operator on the ket | r 0 >; according to 
(E-22-a), we have: 

< r I X I r 0 > = X < r I r 0 > =x <5(r - r 0 ) = x 0 <5(r - r 0 ) = x 0 < r | r 0 > (E-34) 
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This equation expresses the fact that the components, in the {| r)} repre¬ 
sentation, of the ket X | r 0 ) are equal to those of the ket | r 0 ) multiplied 
by x 0 . We therefore have: 


X | r o y ~ x o I r o y (E-35) 

An analogous argument shows that the kets | r 0 ) are also eigenvectors of 
the Y and Z operators. Omitting the index zero which then becomes unnecessary, 
we can write: 


*|r> =*l r > 

r | r > = ^ | r > 
Z | r > = z | r > 


(E-36) 


The kets | r > are therefore the eigenkets common to X, Y and Z. Thus the 
notation | r ) which we chose above is justified : each eigenvector is labelled by 
a vector r, whose components x, y, z represent three continuous indices which 
correspond to the eigenvalues of X, Y, Z. 

Similar arguments can be elaborated for the P operator, placing ourselves, 
this time, in the { | p > } representation. We then obtain: 


P x I P > = P x\ P > 
Py | P ) = P ,\ P ) 
P z | P ) = Pz | P ) 


(E-37) 


COMMENT : 

This result can also be derived from equation (E-26), which gives the 
action of P in the { | r > } representation. Using (E-9), we find: 

= p x ( 2nh )~ 3/2 e« p ' r = p x < r | p > (E-38) 

All the components of the ket P x p > in the { | r ) } representation can 
be obtained by multiplying those of p ) by the constant p x : | p ) is an 
eigenket of P x with the eigenvalue p x . 

d. R AND P ARE OBSERVABLES 

Relations (E-5-b) and (E-5-d) express the fact that the { | r ) } vectors 
and the { | p) } vectors constitute bases in S t . Therefore, R and P are 
observables. 

Moreover, the specification of the three eigenvalues x 0 , y 0 , z 0 of X, Y, Z 
uniquely determines the corresponding eigenvector | r 0 ) : in the { | r ) } repre¬ 
sentation, its coordinates are <5(x — x 0 )t>(.y — y 0 )&{z — z 0 ). The set of the 
three operators X, Y, Z therefore constitutes a C.S.C.O. in S t . 
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It can be shown in the same way that the three components P x , P y , P z 
of P also constitute a C.S.C.O. in £ t . 

Note that, in £ t , X does not constitute a C.S.C.O. by itself. When the x 0 
index is fixed, y 0 and z 0 can take on any real values. Thus, each eigenvalue 
x 0 is infinitely degenerate. On the other hand, in the state space £ x of a 
one-dimensional problem, X constitutes a C.S.C.O.: the eigenvalue x 0 uniquely 
determines the corresponding eigenket | x 0 >, its coordinates being d(x - x 0 ) 
in the { | x > } representation. 


comment: 


We have found two C.S.C.O.'s in <?,, { X, Y, Z } and { P x , P y . P z }. We shall 
encounter others later. Consider, for example, the set { X ., P y , P z }: these three observables 
commute (equations (E-30)); moreover, if the three eigenvalues x 0 ,p 0y and p 0z are fixed, 
there corresponds to them only one ket, whose associated wave function is written: 


^* 0 .poy,po,! x ’ y> z) = ^ x - x o) 


1 

2 ^ 


T-<POy.V + Po. 

e« 


(E-39) 


F. TENSOR PRODUCT OF STATE SPACES* 


1. Introduction 

We introduced the state space of a physical system using the concept 
of a one-particle wave function. However, our reasoning has involved sometimes 
one- and sometimes three-dimensional wave functions. Now it is clear that the 
space of square-integrable functions is not the same for functions of one variable 
i l/(x) as for functions of three variables : £ x and £ x are therefore different 
spaces. Nevertheless, £ x appears to be essentially a generalization of £ x . Does 
there exist a more precise relation between these two spaces ? 

In this section, we are going to define and study the operation of 
taking the tensor product of vector spaces**, and apply it to state spaces. 
This will answer, in particular, the question we have just asked : £ r can be 
constructed from £ x and two other spaces, £ y and £ z , which are isomorphic 
to it (§ F-4-a below). 

In the same way, we shall be concerned later (chapters IV and IX) with 
the existence, for certain particles, of an intrinsic angular momentum or spin. 
In addition to the external degrees of freedom (position, momentum), which are 
treated using the observables R and P defined in £ t , it will be necessary to 
take into account the internal degrees of freedom and to introduce spin observables 
which act in a spin state space £ s . The state space £ of a particle with spin 
will then be seen to be the tensor product of £ t and £ s . 


* This section is not necessary for the understanding of chapter III. One can study it later 
when it becomes necessary to use tensor products (complement D lv ,or chapter IX). 

** This operation is sometimes called the " Kronecker product'*. 
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Finally, the concept of a tensor product of state spaces allows us to 
solve the following problem. Let (S L ) and (S 2 ) be two isolated physical systems 
(they are, for example, sufficiently far apart that their interactions are perfectly 
negligible). The state spaces which correspond to (5,) and (S 2 ) are, respectively, 
£j and £ 2 . Now let us assume that we consider the set of these two systems to 
form one physical system (S) (this becomes indispensable when they are close 
enough to interact). What is then the state space £ of the global system (S) ? 

It can be seen from these examples how useful the definitions and results 
of this section are in quantum mechanics. 


2. Definition and properties of the tensor product 

Let £ 1 and £ 2 be two* spaces, of dimension N x and N 2 respectively 
{N t and N 2 can be finite or infinite). Vectors and operators of these spaces 
will be assigned an index, (I) or (2), depending on whether they belong to 
£y or £ 2 . 


a. TENSOR PRODUCT SPACE 3 

a. Definition 

By definition, the vector space £ is called the tensor product of £, and £ 2 : 

£ = £ l ® £ 2 (F-l) 

if there is associated with each pair of vectors, | <p(l)> belonging to £ x and 
| x(2) ) belonging to £ 2 , a vector of £, denoted by**: 

k(l)>® |Z(2)> (F-2) 

which is called the tensor product of | <p(l) ) and | x(2) ), this correspondence 
satisfying the following conditions: 

(/') It is linear with respect to multiplication by complex numbers: 

[A | <p(l) >] ® | x(2) > = *[ | <P(1) > ® | *(2) >] 

| <p(l) > ® |> | *(2) >] = /u[| <p(l) > ® | x(2) >] (F-3) 

(ii) It is distributive with respect to vector addition: 

k(l)> ® [|X,(2)> + |x 2 (2)>] = k(D> ® |*,(2)> + k(l)> ® |z 2 (2)> 
[ki(l)> + I V> 2 (0 >] ® U(2)> 

= | <Pi(l)> ® | Z(2) > + I <p 2 (!)> ® | z(2) > (F-4) 

( iii ) When a basis has been chosen in each of the spaces £ l and £ 2 , 
{ | m,(1) > } for £, and { | v,(2) > } for £ 2 , the set of vectors | w ( (I) > ® | v,(2) ) 


* The following definitions can easily be extended to the tensor product of a finite number of 

spaces. 

** This vector can be written either | <p(\) > ® | *(2) > or | *(2) > ® | <p(\) >: the order of the 
two vectors is of no importance. 
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constitutes a basis in 8. If N t and N 2 are finite, the dimension of 8 is 
consequently N X N 2 . 

p. Vectors of $ 

(i) Let us first consider a tensor product vector , | <p( 1) > 0 | ^(2) >. Whatever 
|<p(l)> and | y(2) ) may be, they can be expressed in the [ |w,(l)> ] and 
[ | v,(2) ) } bases respectively: 

|<p(l)> - Xn. lMl)) 

i 

U2)> = IM»,(2)> (F-5) 

i 

Using the properties described in § a, the expansion of the vector 
| (p( 1) ) ® | x(2) ) in the j | u t ( 1) ) 0 | v,(2) ) } basis can be written: 

Ml) > 0 I X(2) > = I afi, | «;(1) > 0 | v,(2) > (F-6) 

ij 

Therefore, the components of a tensor product vector are the products 
of the components of the two vectors of the product. 

(ii) There exist in 8 vectors which are not tensor products of a vector 
of 8 , by a vector of 8 2 . Since { | w,(l) ) 0 | v,(2) > } constitutes by hypothesis 
a basis in 8, the most general vector of 8 is expressed by: 

I > = X c u Ml)> 0 | v,(2) > (F-7) 

i.l 

Given N 1 N 2 arbitrary complex numbers c i h it is not always possible to put 
them in the form of products, af,, of TVj numbers a, and N 2 numbers h t . 
Therefore, in general, vectors |<p(l)> and | x(2) ) of which | ^ ) is the tensor 
product do not exist. However, an arbitrary vector of 8 can always be decomposed 
into a linear combination of tensor product vectors, as is shown by formula (F-7). 

y. The scalar product in 8 

The existence of scalar products in and 8 2 permits us to define one 
in 8 as well. We first define the scalar product of | <p(l) y(2) > = | </?(l) > ® | X(2) > 
by | <p'(l) X(2) > = I <p'(l) > ® I x'(2) > by setting: 

<<?'( 1) X'(2)Ml) X(2)> — < <P'(1) M') > < X (2) | x(2) > (F-8) 

For two arbitrary vectors of 8, we simply use the fundamental properties of 
the scalar product [equations (B-9), (B-10) and (B-ll)], since each of these 
vectors is a linear combination of tensor product vectors. 

Notice, in particular, that the basis { | i/,(l) v,(2) ) = | « f (l) ) 0 | vf2) > } is 
orthonormal if each of the bases { | m ,(1 )) } and { | v t (2) ) } is: 

< « f .(1) v t \2) \ u,(l) ,,,(2) > = < MO I “.<!) > < M2) | «’,(2) > 

= 5 tr S u . (F-9) 
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b. TENSOR PRODUCT OF OPERATORS 

(i) First, consider a linear operator ,4(1) defined in S v We associate 
with it a linear operator A( 1) acting in S, which we call the extension of 
A( 1) in S, and which is characterized in the following way : when .4(1) is 
applied to a tensor product vector | <p(l) ) ® | ^(2) ), one obtains, by definition : 

^(1)[| </>U) >® U(2)>] = [^(l)k(l))]® U(2)> (F-10) 

The hypothesis that ,4(1) is linear is then sufficient for determining, it 
completely. An arbitrary vector 1 1 p } of £ can be written in the form (F-7). 
Definition (F-10) then gives the action of ,4(1) on | tj/ ): 

= X c i,i[ y 4(l) | M i(l) >] ® I v i(2) > (F-l 1) 

i,« 

We obtain in an analogous manner the extension 5(2) of an operator B( 2) 
initially defined in S 2 . 

(ii) Now let A( 1) and 5(2) be two linear operators acting respectively in $ x 
and $2 • Their tensor product ,4(1) (g) 5(2) is the linear operator in &, defined by the 
following relation which describes its action on the tensor product vectors: 

[^(1)® 5(2)] [| <j»(l) > ® | x(2)>] = [^4(1)| <?(!)>] ® [5(2) | x(2)>] (F-12) 

Here also, this definition is sufficient for characterizing ,4(1) ® 5(2). 

comments : 

(/) The extensions of operators are special cases of tensor products: if 11 (1) and H (2) are the 

identity operators in S\ and S 2 respectively, ,4(1) and 5(2) can be written: 


,4(1) = ,4(1)®1I(2) 

5(2) =1(1)® 5(2) (F-13) 

Inversely, the tensor product ,4(1)0 5(2) coincides with the ordinary product 
of two operators ,4(1) and 5(2) of S: 

A(\) ® B(2) = A(\)B(2) (F-14) 

(ii) It is easy to show that two operators such as ,4(1) and 5(2) commute in S': 

[^(1), 5(2)] = 0 (F-15) 

We must verify that A(l)B(2) and B(2)A(l) yield the same result when they act on an 
arbitrary vector of the { | « f (l) > O | v,(2) > } basis: 

^(1)5(2) | M f (l) > O | v t (2) > = ,4(1) | «i(l) > O [5(2) | v t (2) >] 

= Ml ) | M( (l ) >] O [5(2) | v,(2) >] (F-16) 

5(2)^(1) | «,(1) > O | v,(2) > = 5(2) [^(1) | ii,(l) >] O | v,(2) > 

= [>*(1) I «,(1) >] ® [5(2) | v,(2) >] (F-17) 

(Hi) The projector onto the tensor product vector | <p(l)x(2) > = | <p(l) > ® | x (2) >, which 
is an operator acting in S, is obtained by taking the tensor product of the projectors 
onto | <p(l) > and | x(2) >: 

k(l)x(2)><<p(l)*(2)| = |p(l)><p(l)|® |x(2)><x(2)| (F-18) 
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This relation follows immediately from the definition of the scalar product in <f. 

(id) Just as with vectors, there exist operators in $ which are not tensor products of an 
operator of <?, and an operator of & 2 . 


C. NOTATION 

In quantum mechanics, the notation generally used is a simplified version 
of the one which we have defined here. This is the one we shall adopt, but it is 
important to interpret it correctly in the light of the preceding discussion. 

First of all, the symbol ® which indicates the tensor product is omitted, 
and the vectors or operators which are to be multiplied tensorially are simply 
juxtaposed: 

MO>U( 2 )> means M0> ® |*( 2 )> ( F * 19 ) 

,4(1)5(2) means ,4(1)0 5(2) (F-20) 

Moreover, the extension in $ of an operator of $ 2 or S 2 is written in the same way 
as this operator itself: 

A( 1) means ,3(1) or A(l) (F-21) 

No confusion is possible in (F-19): until now we have never written two 
kets one after the other as we do here. Notice in particular that the expres¬ 
sion | $ ) | (p ), where 1 1 1/ > and | </>) belong to the same space S, is not defined in 
this space : it represents a vector of the space which is the tensor product of S 
by itself. 

On the other hand, the notation in (F-20) and (F-21) is slightly ambiguous, 
especially in the latter, where two different operators are represented by the same 
symbol. However, it will be possible in practice to distinguish between them by 
the vector to which this symbol is applied : depending on whether it is a vector of & 
or of <fj, we shall be dealing with ,3(1), or with .4(1) in a strict sense. As for for¬ 
mula (F-20), it poses no problem when and S 2 are different, since we have, until 
now, defined only products of operators which act in the same space. More¬ 
over, ,4(1) 5(2) can be considered to be an ordinary product of operators of <?, 

if A( 1) and 5(2) are interpreted as designating, in fact, ,4(1) and 5(2) [equa¬ 
tion (F-14)]. 


3. Eigenvalue equations in the product space 

The vectors of $ which are tensor products of a vector of and a vector 
of &2 P^y an important role in the discussion above. We shall see that this is also 
the case for the extensions of operators of and $ 2 . 

a. EIGENVALUES AND EIGENVECTORS OF EXTENDED OPERATORS 

a. Eigenvalue equation of A( 1) 

Consider an operator ,4(1), for which we know, in S 2 , all the eigenstates 
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and eigenvalues. We shall assume, for example, that the whole spectrum of .4(1) 
is discrete: 

^0)ki(l)> = ; / - 1, 2, g n (F-22) 

We want to solve the eigenvalue equation of the extension of 4( 1) in 8: 

4(1)|^> = |<A>^ (F-23) 

It can immediately be seen, from (F-10), that every vector of the 
form | <pj,(l) ) | x(2) ) is an eigenvector of A( 1) with the eigenvalue what¬ 
ever | x(2) ) may be, since: 

■4(i>k:<i)>i*(2)> - uii)k:(i)>]iz(2)> 

-«.ki(l)>lxP» (F- 24 ) 

Let us show that when ^4(1) is an observable in <? l5 all the solutions of (F-23) 
can be obtained in this way. The set of the |<p*(l)> then forms a basis in 8 x . 
Consequently, the orthonormal system of vectors such that: 

\r n ' l > = ki(l)>|»,(2)> (F-25) 

where { |r,(2)> } is a basis of 8 2 , also forms a basis in 8. We therefore have an 
orthonormal basis constituted by the eigenvectors of A( 1) in 8, { |^’') }, which 
means that equation (F-23) is solved. 

The following conclusions can be drawn : 

— If A{\) is an observable in 8 1 , it is also an observable in 8. This results 
from the fact that the extension of 4(1) is Hermitian and from the fact that { | ) } 

constitutes a basis in 8. 

— The spectrum of 4(1) is the same in 8 as in 8 1 : the same eigenvalues a„ 
appear in (F-22) and in (F-24). 

— Nevertheless, an eigenvalue a n which is g„-fold degenerate in 8 l has, in 8, 
a degree of degeneracy N 2 x g n . We known that the eigensubspace associated 
with a n is spanned in 8 by the kets |^•') = \(p‘„(l )) |t>,(2)> with n fixed 

and i = 1,2. g„; l = 1,2 ,...,N 2 - Therefore, even if a n is simple in 8 it 

is (JV 2 -fold) degenerate in 8. 

The projector onto the eigensubspace corresponding to an eigenvalue a n is written, in S 
[cf.{ F-18)]: 

I \¥i' >< W I = 11 «>n(l) > < m | ® | «/(2) > < v,(2) | 

i.l U 

= Ik‘(l)XK(l)|®H(2) (F-26) 

using in S 2 the closure relation relative to the { 1> } basis. It is therefore the extension 
of the projector /^(l) = £ | (p'„(l) > < </)^(I) | which is associated with a n in S ',. 

|3. Eigenvalue equation of 4(1) + 5(2) 

We shall often need to solve, in a tensor product space such as 8 , eigenvalue 
equations for operators of the form: 

C = 4(1) + 5(2) (F-27) 
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where .4(1) and 5(2) are observables whose eigenvalues and eigenvectors are 
known in g x and g 2 respectively: 

B{ 2)U P (2)> = b p \ Xp ( 2)> (F-28) 

[to simplify the notation, we assume the spectra of 4(1) and B( 2) to be discrete 
and non-degenerate in g x and g 2 J. 

4(l)and5(2)commute [formulas (F-16)and (F-17)],andthe| <p„(l) ) | x p (2) >, 
which form a basis in g, are eigenvectors common to 4(1) and 5(2): 

^(l)k„0)>|Xp(2)> = o n \cp„( l)>| Z p(2)> 

5(2) | <p„(l) > | Xp {2) > = b p \<p n ( 1) > | x P (2)> (F-29) 

They are also eigenvectors of C: 

C\ <P„(1)> | Xp(2)> = (a n + b p )\<p tt (l)>\ Xp (2)> (F-30) 

This gives us directly the solution of the eigenvalue equation of C. 

Therefore : the eigenvalues of C — A( 1) + B( 2) are the sums of an eigenvalue 
of A( 1) and an eigenvalue of B( 2). <9^ can find a basis of eigenvectors of C which 
are tensor products of an eigenvector of /1(1) and an eigenvector of B( 2). 


comment : 

Equation (F-30) shows that the eigenvalues of C are all of the form c np — a n 4- b p . 
If two different pairs of values of n and p which give the same value for c np do not exist, 
c np is non-degenerate (recall that we have assumed a n and b p to be non-degenerate in 
and $ 2 respectively). The corresponding eigenvector of C is necessarily the tensor 
product | (p n ( 1) > | x P (2)>- If, on the other hand, the eigenvalue c np is, for example, 
two-fold degenerate (there exist m and q such that c mq = c rtp ), all that can be asserted 
is that every eigenvector of C corresponding to this eigenvalue is written: 

^k B (l)>|x P (2)> + ^|<p m (l)>|x,(2)> (F-31) 

where a and /( are arbitrary complex numbers. In this case, therefore, there exist 
eigenvectors of C which are not tensor products. 


b. COMPLETE SETS OF COMMUTING OBSERVABLES IN g 

We are finally going to show that if a C.S.C.O. has been chosen in both 
spaces S j and <? 2 , obtaining one in g is straightforward. 

As an example, let us consider the case where 4(1) constitutes a C.S.C.O. 
by itself in g and the C.S.C.O. in g 2 is composed of two observables, 5(2) 
and C(2). This means (cf. § D-3-b) that all the eigenvalues a„ of 4(1) are 
nondegenerate in g ,: 

4(1) | <p„(D> = « n |<P„(l)> (F-32) 

the ket |<p„(l)) being unique to within a constant factor. On the other hand, in g 2 , 
some of the eigenvalues b p of 5(2) are degenerate, as are some of the eigenvalues (c r ) 
of C(2). Nevertheless, the basis of eigenvectors common to 5(2) and C(2) is 
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unique in 8 2 , since there exists only one ket (to within a constant factor) which is 
an eigenvector of 5(2) and of C(2) with the eigenvalues b p and c r fixed: 

■®(2) | X pr (2) j = b p \ x pr (2) > 

. C(2) | x pr (2) > = c r | y pr (2) > 

| x pr (2) > unique to within a constant factor (F-33) 

In 8, each of the eigenvalues a„ is jV 2 -fold degenerate (cf § F-3-a). Therefore, 
,4(1) no longer forms a C.S.C.O. by itself. Similarly, there exist N x linearly 
independent kets which are eigenvectors of 5(2) and C(2) with the eigenvalues b p 
and c, respectively, and the set { 5(2), C(2) } is not complete either. However, 
we saw in § F-3-a that the eigenvectors which are common to the three commuting 
observables ,4(1), 5(2) and C( 2) are the | <p„(l)x pr (2) > = | <p„(l) > | y pr (2) > : 

A(\)\<p n (l) x pr (2)) = « n |<p„(l) XJ2)} 

5(2) | <p„(l) x pr (2)} = b p \<p n (]) Xp r( 2)> 

C(2) j (p„(l) X P r(2) > = f, | <P„( 1) Xp,(2) > (F-34) 

The system { |<p n (l)x (2)> } constitutes a basis in 8, since this is the case 
for { |<p„(l)> } and { [ x pr (2)) } in 8 X and 8 2 respectively. Moreover, if a set of 
three eigenvalues { a„, b p , c r } is chosen, only one vector | cp„( l)x pr (2)> corresponds 
to it. A( 1), 5(2) and C(2) therefore constitute a C.S.C.O. in 8. 

The preceding argument can be generalized without difficulty : by joining 
two sets of commuting observables which are complete in 8 x and 8 2 respectively, 
one obtains a complete set of commuting observables in 8. 


4. Applications 

a. ONE- AND THREE-DIMENSIONAL PARTICLE STATES 

a. State spaces 

Consider again, in the light of the preceding discussion, the problem posed 
in the introduction (§ F-l): how are 8 X and 8 t related? 

8 X is the state space of a one-dimensional particle, that is, the state space 
associated with the wave functions cp(x). In 8 X , the observable X which was studied 
in § E-2 constitutes a C.S.C.O. by itself (§ E-2-d); its eigenvectors are the basis 
kets of the { |x> } representation. A vector | <p ) of 8 x is characterized, in this 
representation, by a wave function <p(x) = < x | tp >; in particular, the basis ket | x 0 ) 
corresponds to ^* 0 (x) = <5(x — x 0 ). 

In the same way, it is possible to introduce the spaces 8 and 8 Z associated 
with the wave functions x(y) and a>(z). The observable Y forms a C.S.C.O. in 8 y , 
as does Z in 8 Z . The corresponding eigenvectors are the basis kets of the {|y> } 
and { | z ) } representations of 8 y and 8, respectively. A vector | x ) of 8 y (or | a >) 
of 8 Z ) is characterized in the { |y> } (or { |z> }) representation by a func¬ 
tion x(y) = <t|x> ( or = <^|o>). The function which corresponds to the 
basis ket |y 0 > (or |z 0 >) is 5(y - y 0 ) (or <5(z - z 0 )). 
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Let us then form the tensor product: 

*xn = # x ®#y®£ z (F-35) 

We obtain a basis in i xyz from the tensor product of the { | x > }, { | y > } 
and { |z> } bases. We shall denote it by { |x, y, z> }, with: 

|x,y,z> = |x>|y>|z> (F-36) 

The basis kets are simultaneous eigenvectors of the X, Y and Z operators extended 
into & x yz • 

X\x,y,z) = x\x,y,z} 

Y\x,y,z} = y\x,y,z) 

Z\x,y,z > = z\x,y,z} (F-37) 

Therefore, S xyz coincides with S t , the state space of a three-dimensional particle, 
and | x, y, z > with | r >: 

\x,y,z}= | r > = | x > | y > | z > (F-38) 

where x, y, z are precisely the cartesian coordinates of r. 

There exist in kets |tp x^°) = |<P)|z)| tt> > which are the tensor 
products of three kets, one of S x , one of S y and one of i z . Their components 
in the { |r) } representation are then [cf. formula (F-8)] : 

< r | (p X t»> = <x|(p><y|x><^|«>> (F-39) 

The associated wave functions are thus factorized : <p(x) x(y) (o(z). This is the case 
for the basis vectors themselves: 

< r | r 0 > = <5(r - r 0 ) = c5(x - x 0 ) S(y - y 0 ) S(z - z 0 ) (F-40) 

Note that the most general state of S t is not such a product. It is written : 

dx dy dz i p(x, y, z) | x, y, z > (F-41) 

In i/V(x, y, z) = < x, y, z \ t]/ ), the x-, y- and z-dependences cannot, in general, be 
factorized : each of the wave functions associated with the kets of is a wave 
function with three variables. 

The results of § F-3 thus enable us to understand why X, which constitutes 
a C.S.C.O. by itself in S x , no longer has this property in i t (cf. § E-2-d): the 
eigenvalues of its extension in are the same as in S x , but they become infinitely 
degenerate because and S, are infinite-dimensional. Starting with a C.S.C.O. 
in <f x , and S z , we construct one for S t : { X, Y, Z }, for example, but 

also { P x , Y, Z } since P x forms a C.S.C.O. in S x , or { P x , P y , Z }, etc... 

P. An important application 

Let us try to solve in S t the eigenvalue equation of an operator H such that: 
H = H x + H y + H z (F-42) 
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where H x , H y and H z are the extensions of observables acting respectively in S x , 
g y and S 2 . In practice, one recognizes that H x , for example, is the extension of 
an observable of S x because it is constructed using only the operators X and P x . 
Using the reasoning of § F-3-a-|3,one first looks for the eigenvalues and eigen¬ 
vectors of H x in S x , H y in S y and H z in § z : 

H x \ <P„> =E" x \<p n > 

H y \x P > =^|X P > 

H z \a>,> = E[ | w r > (F-43) 

The eigenvalues of H are then all of the form: 

E npr = E n x + E p + E\ (F-44) 

and an eigenvector corresponding to E n ’ p,r is the tensor product | <P„ } | X P ) I )» 
the wave function associated with this vector is the product: 

</>„(*) Xpiy ) = < X I <P„ > < y | Xp > < 2 | >• 

This is the type of situation that was considered in complement F t (§2) for the 
justification of the study of one-dimensional models. There, we were dealing with 
differential operators acting on wave functions: 

ft 1 

H = ~2^ A + K(r) < F ‘ 45 ) 

This equation can be decomposed as in (F-42) in the particular case where the 
potential can be written: 

V(t) = V t (x) + V 2 (y) + V 3 (z) (F-46) 

b. STATES OF A TWO-PARTICLE SYSTEM 

Consider a physical system which is made up of two (spinless) particles. We 
shall distinguish between them by numbering them (1) and (2). To describe the 
system quantum mechanically, we can generalize the concept of a wave function, 
introduced for the case of one particle. A state of the system can be characterized, at a 
given time, by a function of six spatial variables ip( r l5 r 2 ) = ^(x 1 ,y l ,z i ; x 2 ,y 2 ,z 2 ). 
The probabilistic interpretation of such a two-particle wave function is the fol¬ 
lowing: the probability d^ > (r,,r 2 ), at the given time, of finding particle (1) in the 
volume c/ 3 r! = dxj d y x dz, situated at the point r 1? and particle (2) in the volume 
d 3 r 2 = dx 2 d y 2 d z 2 about r 2 , is: 

d^(r,, r 2 ) = C |(//(r,, r 2 )| 2 d 3 r r d 3 r 2 (F-47) 

The normalization constant C is obtained by imposing the condition that the total 
probability must be equal to 1 (conservation of the number of particles; cf. § B-2 
of chapter I): 

= j<f 3 r, d 3 r 2 |^(r„ r 2 )| 2 (F-48) 
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and the observables X x , Y x , Z, can be defined in S ti . Similarly, in the state space 
of particle (2), we introduce the { | r 2 > } representation and the observables X 2 , 
Y 2 , Z 2 . Take the tensor product: 

= K ® (F-49) 


The set of vectors: 

I r n r 2 > = I r, > | r 2 > (F-50) 

forms a basis in Consequently, every ket | ^ > of this space can be written : 

I *> = [d 3 r l d 3 r 2 ^(r t , r 2 )| r 1( r 2 > (F-51) 


with 

r 2 ) = < r„ r 2 | ^ > (F-52) 

Moreover, the square of the norm of | t]/ ) is equal to: 

= j^ 3 ', d 3 r 2 \ ifav r i)\ 2 (F-53) 

For it to be finite, tp( r,, r 2 ) must be square-integrable. Therefore, a wave func¬ 
tion ^(r,, r 2 ) is associated with each ket of <f r|1 . 2 : the state space of a two-particle 
system is the tensor product of the spaces which correspond to each of the particles. 
A C.S.C.O. is obtained in S, lt2 by joining, for example, Z,, F,, Z x and X 2 , F 2 , Z 2 . 

Assume that the state of the system is described by a tensor product ket: 

I > = | > | i>i > (F-54) 

The corresponding wave function can then be factorized: 

r 2 ) = < r i» r 2 I > = < r i I '/'i > < T i I ^2 > = <Pi( T t) •Mfi) (F-55) 

In this case, one says that there is no correlation between the two particles. We shall analyze 
later (complement D,,,) the physical consequences of such a situation. 

The preceding can be generalized : when a physical system is composed of 
the union of two or several simpler systems, its state space is the tensor product 
of the spaces which correspond to each of the component systems. 


References and suggestions for further reading: 

Section 10 of the bibliography contains references to a certain number of 
mathematical texts, classified by subject. Under each heading, they are listed as much as 
possible in order of increasing difficulty. See also the quantum mechanics texts 
(sections 1 and 2 of the bibliography), which treat the mathematical problems at many 
different levels. They also contain other references. 

For a very simple approach to the fundamental mathematical concepts needed to 
understand chapter II (vector spaces, operators, diagonalization of matrices, etc.), 
the reader can consult, for example : Arfken (10.4), chap. 4; Bak and Lichtenberg (10.3), 
chap. I; Bass (10 1), vol. I, chap. II to V. A more explicit application to quantum 
mechanics can be found in Jackson (10.5) (see, in particular, chap. 5), Butkov (10.8), 
chap. 10 (finite-dimensional linear spaces) and chap. 11 (infinite-dimensional vector 
spaces, spaces of functions). See also Meijer and Bauer (2.18), chap. 1, particularly the 
table at the end of this chapter. 
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COMPLEMENTS OF CHAPTER II 


A„: THE SCHWARZ INEQUALITY 

B„: REVIEW OF SOME USEFUL PROPERTIES 

OF LINEAR OPERATORS 

C„: UNITARY OPERATORS 

A,,, B n , C„: review of some definitions and 
useful mathematical results (elementary level) 
intended for readers unfamiliar with these con¬ 
cepts ; will serve as a reference later (especially B„). 

D„: A MORE DETAILED STUDY OF THE [ | r > | 
AND ' | p > i REPRESENTATIONS 

E„: SOME GENERAL PROPERTIES 

OF TWO OBSERVABLES, Q AND P, 

WHOSE COMMUTATOR IS EQUAL TO /7> 

D n , E„: complete § E of chapter II : 

D„: remains at the level of chapter II and can 
be read immediately after it. 

E n : adopts a more general and a slightly more 
formal point of view. Introduces, in particular, 
the translation operator. May be reserved for 
later study. 

F|, : THE PARITY OPERATOR 

Fu: discussion of the parity operator, particu¬ 
larly important in quantum mechanics; at the 
same time, a simple illustration of the concepts 
of chapter II; recommended for these two 

reasons. 

G„: AN APPLICATION OF THE PROPERTIES 

OF THE TENSOR PRODUCT : THE 

TWO-DIMENSIONAL INFINITE WELL 

G (l : a simple application of the tensor product 
(§ F of chapter II); can be considered as a worked 
exercise. 

H„: EXERCISES 

H„: solutions are given for exercises 11 and 12; 
their aim is to familiarize the reader with the 
properties of commuting observables and the 
concept of a C.S.C.O. in a very simple special 
case. It is recommended that these exercises be 
done during the reading of § D-3 of chapter II. 
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Complement A„ 

THE SCHWARZ INEQUALITY 


For any ket | ip > belonging to the state space S, we have: 

< (A I ^ > real >0 (1) 

< \j/ | \)/ ) being equal to zero only when | ip ) is the null vector [c/. equation (B-12) 
of chapter II]. Using inequality (1), we shall derive the Schwarz inequality, which 
states that, if | ) and | <p 2 ) are any arbitrary vectors of S, then: 


|< <P 1 | <(>2 >| 2 < < <Pl | <Pl > < <P2 I <?2 > 


( 2 ) 


the equality being realized if and only if | <p 2 > and | q> 2 ) are proportional. 

Given | <p 1 ) and | <p 2 ), consider the ket | i/< ) defined by: 

I ^ > = I (Pi > + A | (p 2 > (3) 

where X is an arbitrary parameter. Whatever X may be: 

= < (Pl I <Pl > + * < <Pl | <Pl > + ** < <P2 | <Pl > + < <P 2 I <?2 > > 0 (4) 

Let us chose for A the value: 


< <Pl 

<P i > 

< <P 2 

<P 2 > 


In (4), the second and third terms of the right-hand side are then equal, and opposite 
in value to the fourth term, so that (4) reduces to: 


< <Pi | (Pi > 


< I fi > < 1 <Pi > - n 

< V 2 1 <pI > " 


( 6 ) 


Since < <p 2 I <?2 ) is positive, we can multiply this inequality by < (p 2 \ q> 2 ), to 
obtain: 

< (Pl | <Pl > < <P 2 | <?2 > > < <Pl | <P2 > < <?2 | <Pl > (7) 

which is precisely (2). In (7), the equality can only be realized if < | ^ ) = 0, 

that is, according to (3), if j <jpj > = — / | (p 2 >. The kets | q> l > and | (p 2 > are then 
proportional. 


References: 

BassI (10.1),§5-3; Arfken (10.4),§9-4. 


165 




COMPLEMENT B 


Complement B„ 

REVIEW OF SOME USEFUL PROPERTIES OF LINEAR OPERATORS 


1. Trace of an onerator 

a. Definition 

b. The trace is invariant 

c. Imoortant properties 

2. Commutator algebra 

a. Definition 

b. Properties 

3. Restriction of an operator to a subspace 

4. Functions of operators 

a. Definition : simple properties 

b. An important example : the potential operator 

c. Commutators involving functions of operators 

5. Differentiation of an operator 

a. Definition 

b. Differentiation rules 

c. Examples 

d. An application : a useful formula 

The aim of this complement is to review a certain number of definitions and useful 
properties of linear operators. 


1. Trace of an operator 

a. DEFINITION 

The trace of an operator A, written Tr A, is the sum of its diagonal matrix 
elements. 

When a discrete orthonormal basis, { | m. ) }, is chosen for the space $, one 
has, by definition: 


Tr A = X < u i I A I u i > 

t 

For the case of a continuous orthonormal basis { | w x > }, one has: 


Tr A 


da < w a | A | w, > 


( 1 ) 


( 2 ) 


When S is an infinite-dimensional space, the trace of the operator A is defined only 
if expressions (1) and (2) converge. 
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b. THE TRACE IS INVARIANT 

The sum of the diagonal elements of the matrix which represents an opera¬ 
tor A in an arbitrary basis does not depend on this basis. 

Let us derive this property for the case of a change from one discrete ortho¬ 
normal basis { | w f > } to another discrete orthonormal basis { | t k > }. We have: 

Z < «,• \ A I «, > = Z < «i | TZ I h > < h \^ A I «,• > (3) 

(where we have used the closure relation for the | t k > states). The right-hand side 
of (3) is equal to: 

Z < u i I h > < h | A | m, > = £ < h | A | > < u t | t k > (4) 

i,k i,k 

(since it is possible to change the order of two numbers in a product). We can then 
replace £ | u ( ) < u { | in (4) by U (closure relation for the | u t > states), and we obtain, 

finally: 

K».Mk>-S<«.Mk> (5) 

i k 

We have therefore demonstrated the property of invariance for this case. 
COMMENT : 

If the operator A is an observable, Tr A can therefore be calculated in 
a basis of eigenvectors of A. The diagonal matrix elements are then the eigen¬ 
values a n of A (degree of degeneracy g n ) and the trace can be written: 

TrA = Zg n a„ ( 6 ) 


c. IMPORTANT PROPERTIES 

Tr AB = Tr BA (7a) 

Tr ABC = Tr BCA = Tr CAB (7b) 

In general, the trace of the product of any number of operators is invariant when a 
cyclic permutation is performed on these operators. 

Let us prove, for example, relation (7-a): 

Tr AB = X < «.• \ AB I «( > = Z < u t I A I u j > < u i I B I u t > 

i ij 

= Z < I ^ I > < «. I ^ I > =Y J <u j \BA\u j y = TtBA (8) 

ij J 

(twice using the closure relation on the { | w ; > } basis). Relation (7-a) is thus proved; 
its generalization (7-b) presents no difficulty. 
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2. Commutator algebra 

a. DEFINITION 

The commutator [ A , B~\ of two operators is, by definition: 


[A, B~\ = AB — BA (9) 

b. PROPERTIES 

[A, B] = - [B, A] (10) 

[A, (B + O] = [A, B] + [A, C] (11) 

[A, BC ] = [A, B\C + B[A, C] (12) 

[A, [ B , C]] + [B, [C, A]] + [C, [A, B ]] = 0 (13) 

[A, BY = [B\ A*] (14) 


The derivation of these properties is straightforward : it suffices to compare both 
sides of each equation after having written them out explicitly. 


3. Restriction of an operator to a subspace 

Let P q be the projector onto the ^-dimensional subspace S q spanned by the q 
orthonormal vectors | q> t ): 

F ,-t k.Xv.l (is) 

i= 1 

By definition, the restriction A q of the operator A to the subspace $ q is : 

A q = P q AP q (16) 

If | ^ ) is an arbitrary ket, it follows from this definition that: 

A q | * > = P q A | <P q > (17) 

where: 

I (is) 

is the orthogonal projection of | \p > onto S q . Consequently, to make A q act on 
an arbitrary ket | tp ), one begins by projecting this ket onto S q \ then one lets the 
operator A act on this projection, retaining only the projection in S q of the 
resulting ket. The operator A q , which transforms any ket of S q into a ket belonging 
to this same subsnace, is therefore an operator whose action has been restricted to & . 

What can be said about the matrix which represents A q ? Let us choose a basis 
{ | u k > } whose first q vectors belong to S q (they are, for example, the | q> i )), the 
others belonging to the supplementary subspace. We have: 

< u t | A q | Uj > = < u t | P q AP q | Uj > (19) 
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that is: 

|^0 if one of the two indices i or j is greater than q (20) 

Therefore, the matrix which represents A q is, as it were, “cut out” of the one which 
represents A. One retains only the matrix elements of A associated with basis 
vectors | u, ) and | ), both belonging to <£ q , the other matrix elements being 

replaced by zeros. 


4. Functions of operators 

a. DEFINITION; SIMPLE PROPERTIES 

Consider an arbitrary linear operator A. It is not difficult to define the opera¬ 
tor A": it is the operator which corresponds to n successive applications of the 
operator A. The definition of the operator A ~ 1 , the inverse of A, is also well known : 
A _1 is the operator (if it exists) which satisfies the relations : 

A~ l A.= AA~ l = U (21) 

How can we define, in a more general way, an arbitrary function of an opera¬ 
tor? To do this, let us consider a function F of a variable z. Assume that, in a certain 
domain, F can be expanded in a power series in z: 

m = X /„ z- (22) 

n = 0 

By definition, the corresponding function of the operator A is the operator F(A ) 
defined by a series which has the same coefficients /„: 

HA) = £ /„ A" (23) 

n = 0 

For example, the operator e A is defined by: 

C 7 * = £ ^ = H + A + A 2 /2\ + ... + A n jn\ + ... (24) 

n = I0 " ! 

We shall not consider the problems concerning the convergence of the series (23), 
which depends on the eigenvalues of A and on the radius of convergence of the 
series (22). 

Note that if F{z) is a real function, the coefficients j n are real. If, moreover, 
A is Hermitian, we see from (23) that F(A) is Hermitian. 

Let | <p a ) be an eigenvector of A with eigenvalue a : 

A\<p a > = a\(p a > (25) 

Applying the operator n times in succession, we obtain : 

A” \cp a > = a" \<p a > (26) 


169 




COMPLEMENT B„ 

Now let us apply series (23) to | <p a ); we obtain: 

F(A)\cp a > = £ f„a n \<p a >= (27) 

n = 0 

This leads to the following rule : when | <p a ) is an eigenvector of A with the eigen¬ 
value a, | (p a > is also an eigenvector of F(A), with the eigenvalue F{a). 

This property leads to a second definition of a function of an operator. Let us 
consider a diagonalizable operator A (this is always the case if A is an observable), 
and let us choose a basis where the matrix associated with A is actually diagonal 
(its elements are then the eigenvalues a x of A). F(A) is, by definition, the operator 
which is represented, in this same basis, by the diagonal matrix whose elements 
are F(a,.). 

For example, if o z is the matrix 



it follows directly that : 



comment : 

Care must be taken, when functions of operators are used, with respect 
to the order of the operators. For example, the operators e A e B , e B e A , and e A+B 
are not, in general, equal when A and B are operators and not numbers. 
Consider: 




yAfyBf = yAf_B q 
P P- q '?• hi P* <?• 


cV 

e A+B 


yBfyA^ _ y B q A p 
y(A+_Bf 

7 P- 


(30) 

(31) 

(32) 


When A and B are arbitrary, the right-hand sides of (30), (31) and (32) have 
no reason to be equal (see exercise 7 of complement H„). However, when A 
and B commute, we have : 

[A, B] = 0 e A e* = e B e A = e A+B (33) 
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b. AN IMPORTANT EXAMPLE: THE POTENTIAL OPERATOR 

In one-dimensional problems, we shall often have to consider “potential” 
operators V(X) (so called because they correspond to the classical potential energy 
V(x) of a particle placed in a force field), where V(X) is a function of the position 
operator X. 

It follows from the preceding section that V(X) has as eigenvectors the 
eigenvectors | x ) of X, and we have simply: 

V(X) | x > = V(x) | x > (34) 

The matrix elements of V(X) in the { | x > } representation are therefore: 

< x | V(X) | x' > = F(x)<5(x - x') (35) 

Applying (34) and using the fact that V(X) is Hermitian (the function V(x) 
is real), we obtain : 

< * I Y(X) I > = Y(x) < x | \\) > = V(x)\l/(x) (36) 

This equation shows that in the { | x ) } representation, the action of the opera¬ 
tor V(X) is simply multiplication by F(x). 

The generalization of (34), (35) and (36) to three-dimensional problems can 


be performed without difficulty; in this case, we obtain : 

F(R) | r > = V(r) | r > (37) 

< r | T(R) | r' > = T(r)5(r - r') (38) 

<r| F(R)|^> = V(r)*(r) (39) 

c. COMMUTATORS INVOLVING FUNCTIONS OF OPERATORS 

Definition (23) shows that A commutes with every function of A: 

\_A,F{A)\= 0 (40) 

Similarly, if A and B commute, so do F(A ) and B : 

[ B,A ] = 0 => [B,F(A)-] = 0 (41) 

What will be the commutator of an operator with a function of another 
operator which does not commute with it ? We shall restrict ourselves here to the 
case of the X and P operators, whose commutator is equal to : 

[X, P] = ih (42) 

Using relation (12), we can calculate: 

IX, P 2 ] = [X, PP] = [ X , P]P + PlX , P] = 2ih P (43) 

More generally, let us show that: 

IX, P"] = ihnP n ~ i (44) 

If we assume that this equation is verified, we obtain: 

IX, P” + 1 ] = IX, PP"] = IX, P]P" + PlX, P"] 

= ih P" + ihnPP ■- 1 = ih(n + 1)P" (45) 
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Relation (44) is therefore established by recurrence. 

Now let us calculate the commutator [X, / r (P)]: 

[X, F(P)] = £ [. X , /„ P“] = X ihnf„ P” ~ 1 (46) 

n n 

If F'(z ) denotes the derivative of the function F(z), we recognize in (46) the definition 
of the operator F'(P). Therefore: 


[X, F(P )] = ih F'(P ) 


(47) 


An analogous argument would have enabled us to obtain the symmetric relation: 


j = ~ iTiG\X) 


(48) 


comments: 

(/) The preceding argument is based on the fact that F(P) (or G(A')) depends only 
on P (or on X). It is more difficult to calculate a commutator such as 
[X, <P(X, P)], where <P(X, P) is an operator which depends on both X and P : 
the difficulties arise from the fact that X and P do not commute. 

(ii) Equations (47) and (48) can be generalized to the case of two operators A 
and B which both commute with their commutator. An argument modeled 
on the preceding one shows that, if we have: 

[A, C] = [P, C] = 0 (49) 

with C = [A, 5] (50) 

then: 

[A,F(B)-j = [A, B\F'(B) (51) 


5. Differentiation of an operator 


DEFINITION 

Let A(t) be an operator which depends on an arbitrary variable t. By defi¬ 


d/1 

nition, the derivative with, respect to t is given by the limit (if it exists): 


dA = Um A(t + At) - Ait) 
df o At 


(52) 


The matrix elements of A(t) in an arbitrary basis of /-independent vectors 
| u t ) are functions of /: 


< u , \ A I u j > = A i /O 


(53) 
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/d/4 \ dA d A 

Let us call (— 1 = < u { — u } > the matrix elements of —. It is easy to verify 

' 'ij 

the relation : 


= ± A 

d t).. d t ij 

• / ij 


Thus we obtain a very simple rule : to obtain the matrix elements representing — 

at 

all we must do is take the matrix representing A and differentiate each of its elements 
(without changing their places). 


b. DIFFERENTIATION RULES 

They are analogous to the ones for ordinary functions : 

d dF dG 

_( f + G )._ + _ (55) 

d dF dG 

= dr c + f -dr <56) 

Nevertheless, care must be taken not to modify the order of the operators in for¬ 
mula (56). 

Let us prove, for example, the second of these equations. The matrix elements 
of FG are: 

< u, I FC I Uj > - I < «, I F I u t > < u k I C I u, > (57) 

k 

We have seen that the matrix elements of d(FG)/dr are the derivatives with respect 
to t of those of (FG). Thus we have, differentiating the right-hand side of (57): 

< “>> = ? [< “'I17I “* > < “*i c 1 "7 > + 

+ <"i|f l“»X“»l^l“/>] 

= K> (58). 

This equation is valid for any i and j. Formula (56) is thus established. 

C. EXAMPLES 

Let us calculate the derivative of the operator e Al . By definition, we have: 


I 
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Differentiating the series term by term, we obtain: 



(60) 


We recognize inside the brackets the series which defines e At (taking as the sum¬ 
mation index p = n — 1). The result is therefore: 

e At = A e At = e 4 'A (61) 


In this simple case involving only one operator, it is unnecessary to pay attention 
to the order of the factors : e At and A commute. 

This is not the case if one is interested in differentiating an operator such 
as e A, e Bt . Applying (56) and (61), we obtain : 

-d_ ^ e At e Bt^ _ AgAtgBt + e A, B e Bt (62) 

The right-hand side of this equation can be transformed into e At Ae m + e At Be 81 
or e^Ae 8 ' 4- e A, e Bl B, for example. However, we can never obtain (unless, of course, 
A and B commute) an expression such as (A + B)e At e m . In this case, the order of 
the operators is therefore important. 


comment: 

Even when the function involves only one operator, differentiation 
cannot always be performed according to the rules valid for ordinary functions. 
For example, when A(t ) has an arbitrary time-dependence, the derivative 

— e A(,) is generally not equal to —e /,<0 . It can be seen by expanding e A(,) in 

dA 

a power series in A(t) that A(t) and —must commute for this equality to hold. 


d. AN APPLICATION: A USEFUL FORMULA 

Consider two operators A and B which, by hypothesis, both commute with 
their commutator. In this case, we shall derive the relation: 

e A e 8 = e 448 e2 lA,B] (63) 

(Glauber’s formula). 

Let us define the operator F(r), a function of the real variable t, by: 

F(l) = e A, e Bt (64) 
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We have: 


— = Ae A, e B ‘ + e A, Be m = (A + e At B e~ A ‘) F(t) (65) 

Since A and B commute with their commutator, formula (51) can be applied in order 
to calculate : 

[e At , B] = t[A,B]e Al (66) 

Therefore: 

e* B = Be A ‘ +t[A, fl] e At (67) 

Multiply both sides of this equation on the right by e~ At . Substituting the relation 
so obtained into (65), we obtain: 

dF 

—jp = (A + B + t[/4, B])T(f) (68) 

The operators A + B and [A, 2?] commute by hypothesis. We can therefore inte¬ 
grate the differential equation (68) as if A + B and [A, B] were numbers. This 
yields: 

F(t). = F( 0) e U+B) ' + ^ M,B1 ' 2 (69) 

Setting t = 0, we see that F(0) = H, and: 

F{t) = (TO) 

Let us then set t = 1; we obtain equation (63), which is thus proven. 


comment: 

When the operators A and B are arbitrary, equation (63) is not in general 
valid: it is necessary that both A and B commute with [A, B~\. This condition 
may seem very restrictive. Actually, in quantum mechanics one often encoun¬ 
ters operators whose commutator is a number: for example, X and P, or the 
operators a and a' of the harmonic oscillator (cf. chap. V). 


References : 

See the subsections “General texts” and “Linear algebra - Hilbert spaces” of 
section 10 of the bibliography. 
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Complement C (( 
UNITARY OPERATORS 


1. General properties of unitary operators 

a. Definition; simple properties 

b. Unitary operators and change of bases 

c. Unitary matrices 

d. Eigenvalues and eigenvectors of a unitary operator 

2. Unitary transformations of operators 

3. The infinitesimal unitary operator 


1. General properties of unitary operators 

a. DEFINITION; SIMPLE PROPERTIES 

By definition, an operator U is unitary if its inverse U~ 1 is equal to 
its adjoint U 1 : 

U'U = UU* = U (1) 

Consider two arbitrary vectors | tp 2 > and \\p 2 ) of g, and their transforms 
| ) and | \j/ 2 > under the action of U: 

l*i> = 

I h> = u\ij /l y (2) 

Let us calculate the scalar product < ^ | ij/ 2 ); we obtain: 

< $i I $ 2 > = < I U'U | * 2 > = < | >l / 2 > (3) 

The unitary transformation associated with the operator U therefore conserves 
the scalar product (and, consequently, the norm) in g . When g is finite-dimensional, 
moreover, this property is characteristic of a unitary operator. 

comments : 

(/) If A is a Hermitian operator, the operator T = e iA is unitary, since: 


T'= e~ iA ' = e~ iA 

(4) 

and therefore: 


T'T = e~ iA e iA = H 


II 

£ 

1 

£ 

II 

(5) 


(obviously, — iA commutes with iA). 
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(ii) The product of two unitary operators is also unitary. If U and V are unitary, 
we have: 


u'u = UU' = 1] 


II 

-*■ 

II 

(6) 

Let us now calculate: 


(UV)'(UV) = V'U'UV = v'v = H 


(UV)(UV)' = uvv'u' = uu'= H 

(7) 


These equations indeed show that the product operator UV is unitary. This 
property, moreover, was foreseeable: when two transformations conserve 
the scalar product, so does the successive application of these two transfor¬ 
mations. 

(Hi) In the ordinary three-dimensional space of real vectors, we are familiar with 
operators which conserve the norm and the scalar product: rotations, sym¬ 
metry operations with respect to a point, to a plane, etc. In this case where the 
space is real, these operators are said to be orthogonal. Unitary operators 
constitute the generalization of orthogonal operators to complex spaces 
(with an arbitrary number of dimensions). 

b. UNITARY OPERATORS AND CHANGE OF BASES 

a. Let { | v ( ) } be an orthonormal basis of the state space <?, assumed to be 
discrete. Call | ) the transform of the vector | ) under the action of the ope¬ 


rator U: 

I », > = V | v t > (8) 

Since the operator U is unitary, we have: 

< | Vj > = <»,K> = S tJ (9) 

The | v ( ) vectors are therefore orthonormal. Let us show that they constitute a 
basis of S. To do so, consider an arbitrary vector | ^ ) of S . Since the set { | v t ) } 
constitutes a basis, the vector £/*| tj/ ) can be expanded on the | i>. >: 

U'\iis> = Ic> f > (10) 

I 

Applying the operator U to this equation, we obtain : 

UU '|*> « Ic.UK> (11) 

i 

and, therefore: 

K > = I c i I > (12) 

t 


This equation expresses the fact that any vector 1 1 p > can be expanded on the vectors 
| n, >, which therefore constitute a basis. Thus we can state the following result: 
a necessary condition for an operator U to be unitary is that the vectors of an 
orthonormal basis of S', transformed by U, constitute another orthonormal basis. 
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/?. Conversely, let us show that this condition is sufficient. By hypothesis, 
we then have : 


I s, > = U I p, > 


< 1 »j > = &ij 


IKXS.M 

j 

(13) 

and therefore: 


< i>j | c;' = < 5, | 

(14) 


Let us calculate: 


u'u\ Vi y = u'\v ( > = Ik><i> J |t/ t k> 

J 

= ZI v j > < »j I > = Z1 »} > ^ 

i j 

= K> (15) 

Relation (15), which is valid for all i, expresses the fact that the operator U'U is the 
identity operator. Let us show, in the same way, that UU' = H. To do this, consider 
the action of U' on a vector | v { >: 

J 

= Z1 v j > < I v i > (16) 

j 

We then have: 

UU'\ Vi > = I C 11,. ><?>,.> 

j 

= ZI h > < ”, I> 

J 

= K> (17) 

We deduce from this that UU* — 1 : the operator U is therefore unitary. 

c. UNITARY MATRICES 

Let: 

u ij = <v i \u\v j y (18) 

be the matrix elements of U. How can one see from the matrix representing U if 
this operator is unitary? 

Relation (1) gives us: 

< I U'U | p, > = £ < », I U' I c k > < It I U 1p, > (19) 

k 

that is: 

Z Ut i U kJ = d ij (20) 

k 
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When a matrix is unitary, the sum of the products of the elements of one column 
and the complex conjugates of the elements of another column is 

— zero if the two columns are different, 

— equal to 1 if they are not. 

Let us cite some examples in which this rule can be easily verified. 


examples: 


(i) The matrix which represents a rotation through an angle 0 about Oz, in 
ordinary three-dimensional space: 


R(6) = 


''cos 0 - sin 0 0 ^ 

sin 0 cos 0 0 

VO 0 1 / 


( 21 ) 


(ii) The rotation matrix in the state space of a spin particle (cf. chap. IX): 


K <1/2) («, p, y) = 


e 2 


^(a + y) __ 0 


cos 2 


v<=>-v) • P 

e 2 sin ^ 


- e 2 (v '* S in| 


, («+ y) „ p 

e 2 cos 


\ 


( 22 ) 


d. EIGENVALUES AND EIGENVECTORS OF A UNITARY OPERATOR 

Let | ip u > be a normalized eigenvector of the unitary operator U with 


eigenvalue u: 

U | ip u > = m | ip u > ( 23 ) 

The square of the norm of the vector U | > is: 

< ip u | U'U | if/ u > = u*u < \p u | t(/ u > = u*u (24) 

Since the unitary operator conserves the norm, we have, necessarily, u*u = 1. The 
eigenvalues of a unitary operator must therefore be complex numbers of modulus 1 : 

u = e'*" where <p u is real (25) 

Consider two eigenvectors | t(/ u ) and | ip u , > of U; we then have: 

<*. I *■’> = <*■ I u ' u I tfv > - u * u ' < •A-1 *■' > 

= < ipy | > ( 26 ) 


When the eigenvalues m and w' are different, we see from (26) that the scalar pro¬ 
duct < ip u | > is zero : two eigenvectors of a unitary operator corresponding to 

different eigenvalues are orthogonal. 
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2. Unitary transformations of operators 

We saw in § 1-b that a unitary operator V permits the construction, starting 
with one orthonormal basis { | v t > } of i, of another one, { | v t > }. In this section, 
we are going to define a transformation which acts, not on the vectors, but on 
the operators. 

By definition, the transform A of the operator A will be the operator which, 
in the { | v ( > } basis, has the same matrix elements as the operator A in the { | v ( > } 
basis: 

(Vi\A\vj> =(v i \A\v j > (27) 

Substituting (8) into this equation, we obtain: 

(v i \U'AU\v j >=(v i \A\v J y (28) 

Since i and j are arbitrary, we have: 

U*AU = A (29) 

or, multiplying this equation on the left by U and on the right by U*\ 

A = UA U* (30) 

Equation (30) can be taken to be the definition of the transform A of the operator A 
by the unitary transformation U. In quantum mechanics, such transformations are 
often used: a first example is given in complement F of this chapter (§ 2-a). 

How can the eigenvectors of A be obtained from those of A ? Let us consider an 
eigenvector | (p a > of A, with an eigenvalue a : 

A | <P a > = a I ( 31 ) 

Let | (p a > be the transform of | (p a > by the operator U : | ijp a > = U \ (p a >. We then 
have: 

A \<p a > = (UA U')U | ip a > = UA(U*U) | <p a > 

= UA \(p a > =aU\<p a > 

= a \q> a y (32) 

| (j) a > is therefore an eigenvector of A, with eigenvalue a. This can be generalized 

to the following rule : the eigenvectors of the transform A of A are the transforms | cp a > 
of the eigenvectors | (p a > of A ; the eigenvalues are unchanged. 

comments: 

(i) The adjoint of the transform A of A by U is the transform of A* by U: 

(A)' = ( UAU 7 = UA* U* = A* (33) 

In particular, it follows from this equation that, if A is Hermitian, A is also. 
(«) Analogously, we have: 

(A) 2 = UAU*UAU* - UAAU* = P 
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and, in general: 

{AY = A n (34) 

Using definition (23) of complement B„, we can easily show that: 

F{A) = F(A) (35) 

where F{A) is a function of the operator A. 


3. The infinitesimal unitary operator 

Let (7(e) be a unitary operator which depends on an infinitely small real quan¬ 
tity e ; by hypothesis, (7(e) — > H when e —0. Expand (7(e) in a power series in e: 


t/(e ) = H + e G + ... (36) 

We then have: 

U\e) = D + e G f + ... (37) 

and: 

(7(e) U*{e) = U'(e)U{e) = 1! + e(G + G f ) + ... (38) 

Since (7(e) is unitary, the first-order terms in e on the right-hand side of (38) are zero; 
we therefore have: 

G + G' = 0 (39) 

This equation expresses the fact that the operator G is anti-Hermitian. It is conve¬ 
nient to set: 

F=iG (40) 

so as to obtain the equation: 

F-F*= 0 (41) 

which states that F is Hermitian. An infinitesimal unitary operator can therefore be 
written in the form: 

(7(e) =11 - is F (42) 

where F is a Hermitian operator. 

Substituting (42) into (30), we obtain: 

A = (D - ieF)A{ H + ieF') = (U - /e/’)v4(H + ieF) (43) 

and, therefore: 

A — A — — ie[F, /l] (44) 


The variation of the operator A under the transformation (7 is, to first order in e, 
proportional to the commutator [F, A\ 
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Complement Du 

A MORE DETAILED STUDY OF THE { | r > } 
AND { | p > } REPRESENTATIONS 


1. The { I r > } representation 

a. The R operator and functions of R 

b. The P operator and functions of P 

c. The Schrodinger equation in the { | r ) } representation 

2. The { I p > } representation 

a. The P operator and functions of P 

b. The R operator and functions of R 

c. The Schrodinger equation in the { | p > } representation 


1. The { | r > } representation 

a. THE R OPERATOR AND FUNCTIONS OF R 

Let us calculate the matrix elements, in the { | r > } representation, of the X, 
Y, Z operators. Using formula (E-36) of chapter II and the orthogonality relations of 
the kets | r >, we immediately obtain: 

< r | A' | r' > = xd(r — r') 

< r | Y | r' > = yd (r - r') 

< r | Z | r' > = zS(r — r') (1) 

These three equations can be condensed into one: 

< r | R | r' > = rd(r-i') (2) 

The matrix elements, in the { | r > } representation, of a function F( R) are 
also very simple [c/. equation (27) of complement B„]: 

<r|F(R)|r'> = F(r)<J(r - r') (3) 

b. THE P OPERATOR AND FUNCTIONS OF P 

Let us calculate the matrix element < r | P x | r' >: 

< r |F,|r'> = Jd 3 p<r|Pjp><p|r'> 

= J^ 3 PP*< r | P > < P | r ’ > 

= (2nH)~ 3 j* d 3 pp x e» p <r r 1 

■ 1 * 
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M 


dp, 


dp 2 e* 1 


From this, it follows that, using the integral form of the “delta function” and its 
derivative [cf. appendix II, equations (34) and (53)]: 

< r | P x | r' > = * S'(x - x')3(y - y’)S(z - z') (5) 

The matrix elements of the other components of P could be obtained in an analogous 
fashion. 

Let us verify that the action of P x in the { | r > } representation can indeed be 
derived from formula (5). To do so, let us calculate: 

<r|Pj^> = pV<r|Pjr'><r'|^> (6) 


From (5): 


< r | P x | (A > = j jV(* - *') dx' j*% - y') dyf jd{z - z')\(/{x', /, z') d z' (7) 


Using the relation 


<5'(- u)f(u)du 


- - \* 


W(u)du = /'(0) 


and taking u = x' — x, we obtain: 


< r | P x | fix, y, z) 


which is indeed equation (E-26) of chapter II. 

What is the value of the matrix element < r | G(P) | r' > of a function G(P) of 
the P operator ? An analogous calculation gives us: 


<r|G(P)|r'> = jd 3 p<r|G(P)|p><p|r'> 


= (2 nh)-* d*pG( p)e« 




= (2nh)- 3/2 G(r - r') 

where G(r) is the inverse Fourier transform of the function G(p): 


G( r) = (2nfi) 3/2 4 3 pe* pr G(p) 
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C. THE SCHRODINGER EQUATION IN THE { | r > } REPRESENTATION 

In chapter III, we shall introduce the Schrodinger equation, which is of funda¬ 
mental importance in quantum mechanics: 

in±- t \m> = H\m> ( 12 ) 

where H is the Hamiltonian operator, which we shall define at that time. For a 
(spinless) particle in a scalar potential V(r) \cf. equation (B-42) of chapter III]: 

H=±P 2 + V(R) (13) 

We are going to see how to write this equation in the { | r > } representation, 
that is, using the wave function ^(r, t), defined by : 

i/>(r, t) = < r | \f/(t) > (14) 

Projecting (12) onto | r ), in the case where H is given by formula (13), we obtain: 

ihj t (r\m> = 2 ^<r|P 2 |W)> + <r\V(R)\m> (15) 

The quantities involved in this equation can be expressed in terms of i^(r, t), since: 

(16) 

< r | V(R) | m > = V(r) ^(r, t) (17) 

The matrix element < r | P 2 | \j/ ) can be calculated by using the fact that in the 
{ | r > } representation, P acts like -j V: 


< r I P 2 I m > = < r I (P 2 + P 2 + P 2 ) | \j/(t) > 
^(8 2 8 2 8 2 \ lt 

= -n 1 t ) 


The Schrodinger equation then becomes: 


* d 

r fi 2 



^-^d + F(r)j 

0 





This is indeed the wave equation introduced in chapter I (§ B-2). 


(18) 


(19) 
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2. The { | p > } representation 

a. THE P OPERATOR AND FUNCTIONS OF P 

We obtain without difficulty formulas analogous to (2) and (3): 

<P I P |P'> = P% - P') (20) 

< P | G(P) | p' > = G(p)S(p - p') (21) 

b. THE R OPERATOR AND FUNCTIONS OF R 

Arguments analogous to those of § 1 give us the formulas which correspond 
to (5) and (10): 

< P I X | p' > = ih 5'(p x - p’ x )5(p y - p' y )5 (p z - p z ) (22) 

and 

< p | F(R) | P' > = (2nhy 312 F (p - p') (23) 

with 

F(p) = (2nfi)~ 312 e * pr F(r) (24) 

C. THE SCHRODINGER EQUATION IN THE { | p > } REPRESENTATION 

Let us introduce the “wave function in the { |p> } representation” by: 


HV, 0 = < p | 4>(t) > (25) 

Using (12), we shall look for the equation which gives the time evolution of i^(p, t). 
Projecting (12) onto the ket | p >, we obtain: 

=^<p|P 2 |^)> + <p|K(R)|^(t)> (26) 

Now we have: 

j t < P | W) > = j- ( <A(P> 0 (27) 

< P | P 2 | m > = P 2 <A(P, t) (28) 

The quantity which remains to be calculated is: 

< p | k(r) | m > = jay < P | v(R) | P ' > < P ' | m > (29) 

Using (23), we find: 

< p | ^(R) | M) > = (2 nh)~ 312 [d 3 p r F(p - p')<A(p', t) (30) 
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where F(p) is the Fourier transform of F(r): 

F(p) = (2nh)~' 3/2 jVv e » p r V(r) (31) 

The Schrodinger equation in the { | p > } representation is therefore written: 


P’ 0 = 2^ ^ p ’ ^ + 


3/2 


d 3 p'V( p - p')i/Hp', t) 


(32) 


comment: 

Since t^(p, /) is the Fourier transform of \f/(r, t) [cf formula (E-l 8) of chapter II], 
it would have been possible to find equation (32) by taking the Fourier trans¬ 
forms of both sides of equation (19). 
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Complement En 

SOME GENERAL PROPERTIES OF TWO OBSERVABLES, Q AND P. 
WHOSE COMMUTATOR IS EQUAL TO ih 


1. The operator 5(A): definition, properties 

2. Eigenvalues and eigenvectors of Q 

a. Spectrum of Q 

b. Degree of degeneracy 

c. Eigenvectors 

3. The { | q > } representation 

a. The action of Q in the { | q > } representation 

b. The action o/S(A) in the { | q > } representation; the translation operator 

c. The action of P in the { | q > } representation 

4. The {!/?>} representation. The symmetrical nature of the P and Q observables 


In quantum mechanics, one often encounters operators whose commutator is equal 
to ih . This is the case, for example, when these two operators correspond to the two 

classical conjugate quantities q { and p { ^q i9 the coordinate in a system of orthonormal 

it- 

axes, and the conjugate momentum p { = -r— 

dates with q ( and p t the operators Q t and P t which satisfy the relation: 

[ 2 ., Pi] = & ( 1 ) 

In §E of chapter II, we encountered such operators : X and P x . In this comple¬ 
ment, we shall take a more general point of view and show that it is possible to 
establish a whole series of important properties relative to two observables P and Q 
whose commutator is equal to ih. All these properties are consequences of com¬ 
mutation relation (1). 


In quantum mechanics, one assO' 


1. The operator S(A): definition, properties 

We shall consider two observables P and Q, satisfying the relation: 

[0, P ] = i* (2) 

and we shall define the operator 5(A), which depends on the real parameter A, by: 
5(A) = e~ up / h (3) 

This operator is unitary; it is easy to verify the relations: 

5 f (A) = 5“ HA) = 5(- A) (4) 
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Let us calculate the commutator [(?, 5(A)]. We can apply formula (51) of 
complement B,„ since [Q, P~\ = ih commutes with Q and P: 


[c, 5(A)] = ih jj e~ aplh = AS(A) 

(5) 

This relation can also be written: 


<2S(A) = S(A)[Q + A] 

(6) 

Finally, note that: 


5(A)S(/*) = 5(A +n) 

(7) 


2. Eigenvalues and eigenvectors of Q 

a. SPECTRUM OF Q 

Assume that Q has a non-zero eigenvector | q >, with eigenvalue q : 

Q \q > = <1 U > ( 8 ) 

Apply equation (6) to the vector | q >. This yields: 

Q S(X)\q} = S(X)(Q + A) | q > 

= S(X)(q + X)\q> = (q + X)S(X)\q> (9) 

This equation expresses the fact that 5(A) | q > is another non-zero eigenvector 
of Q, with an eigenvalue of (q + A) (5(A) | q > is non-zero because 5(A) is unitary). 
Thus, starting with an eigenvector of Q, one can, by applying 5(A), construct another 
eigenvector of Q, with any real eigenvalue (A can indeed take on any real value). 
The spectrum of Q is therefore a continuous spectrum, composed of all possible 
values on the real axis*. 

b. DEGREE OF DEGENERACY 

From now on, we shall assume, for simplicity, that the eigenvalue q of Q is 
non-degenerate (the results which we shall derive can be generalized to the case 
where q is degenerate). Let us show that if q is non-degenerate, all the other eigen¬ 
values of Q are also non-degenerate. Let us assume, for example, that the eigenvalue 
q + A is two-fold degenerate, and we shall show that we arrive at a contradiction. 
There would then exist two orthogonal eigenvectors, | q + A, a > and | q + A, p >, 
corresponding to the eigenvalue q + A: 

<<7 + A,/?|<7 + A,a> = 0 (10) 


* This shows that in a space S of finite dimension N, there are no observables Q and P, whose 
commutator is equal to ih. The number of eigenvalues of Q could not be simultaneously less than or 
equal to N and infinite. 

This result can be derived directly, moreover, by taking the trace of relation (2): 
Tr QP — Tr PQ = I r ih. When N is finite, the traces on the left-hand side of this equation exist: they 
are finite and equal numbers [ cf. complement B„, formula (7-a)]. The equation becomes 0 = Tr ih = Nih, 
which is impossible. 
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Consider the two vectors 5( — X)\q + X, a) and 5( — X)\q + X, /?>. They are, 
according to (9), two eigenvectors of Q, with an eigenvalue of q + X — X — q. 
They are not collinear, since they are orthogonal; their scalar product can be written, 
using the fact that 5(A) is unitary: 

(q + A,/?|5 f (- A)5( — X)\q + A,a> = (q + X,p\q + A,a> =0 (11) 

We reach the conclusion that q is at least two-fold degenerate, which is contrary 
to the initial hypothesis. Consequently, all the eigenvalues of Q must have the same 
degree of degeneracy. 


c. EIGENVECTORS 

We shall fix the relative phases of the different eigenvectors of Q with respect 


to the eigenvector 10 >, of eigenvalue 0 , by setting : 

l«> = %)|0> (12) 

Applying 5(A) to both sides of (12) and using (7), we obtain: 

5(A) | q > = 5(A)5(?)|0> = 5(A + ?)|0> = \q + A> (13) 

The adjoint expression of (13) is written: 

<91 5*(A) = < 9 + A | (14) 

or, using (4) and replacing A by — A: 

<915(A) = <# — A | (15) 


3. The { | q > } representation 


Since Q is an observable, the set of its eigenvectors { \q ) } constitutes a 
basis of $. It is possible to characterize each ket by its “ wave function in the {!<?>} 
representation ”: 

'!'(<}) = (q l<A> (16) 


a. THE ACTION OF Q IN THE ( | q > } REPRESENTATION 

Let us calculate in the { | q > } representation the wave function associated 
with the ket Q \ ip ). It is written: 

<9\Q\^> = 9<9l^> - #(?) (I 7 ) 

[using ( 8 ) and the fact that Q is Hermitian]. The action of Q in the { | q ) } repre¬ 
sentation is therefore simply a multiplication by q. 
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b. THE ACTION OF S(A) IN THE { | q > } REPRESENTATION : 

THE TRANSLATION OPERATOR 

The wave function in the { | q > } representation associated with the 
ket 5(A)| if/} is written [formula (15)]: 

iq\S{X)U>^ = {q - = 4>(q - X) (18) 

The action of the operator 5(A) in the { | q > } representation is therefore a transla¬ 
tion of the wave function over a distance A parallel to the g-axis*. For this reason, 
5(A) is called the translation operator. 


c. THE ACTION OF P IN THE { | g > } REPRESENTATION 

When e is an infinitely small quantity, we have: 


S(- e) = e um =11 + P + 0(e 2 ) 

(19) 

Consequently : 


< q | S( - e) | iP > = M?) + i | < q \ P \ * > + 0(e 2 ) 

(20) 

On the other hand, equation (18) yields: 


< 9 | 5( — e) | ^ > = *(/(q + e) 

(21) 

Comparison of (20) and (21) shows that : 


4>{q + «) = <A(9) + < 91 p 1 ^ > + °( £2 ) 

(22) 


It follows that: 


<«|P|»>-jLim Hi 

1 1 I £-0 £ 


n d . 
7d5^> 


(23) 


The action of P in the { | g > } representation is therefore that of 4-^. Equa¬ 
tion (E-26) of chapter II is thus generalized. 


4. The { | p > } representation. 

The symmetrical nature of the P and Q observables 

Relation (23) enables us to obtain easily the wave function v p (q) associated, 
in the {| q ) } representation, with the eigenvector \p ) of P with an eigenvalue of p : 

v p (q) = < q | P > = (2 nhy 1/2 e *" (24) 


* The function f(x — a) is the function which, at the point x = x 0 4* a y takes on the value / (x 0 ). 
It is therefore the function obtained from f(x) by a translation of + a. 
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We can therefore write: 

| P > = (2itK)~ l/2 f d qJ M \q} (25) 

J - CO 

A ket | } can be defined by its “ wave function in the { \p > } representation ”r 

<Kp) = < P | ^ > (26) 

Using the adjoint relation of (25), we obtain: 

m = {2nh)~^[ d q^ Pq tl,(q) (27) 

J - CO 

i l/(p) is therefore the Fourier transform of t //(q). 

The action of the P operator in the { | p > } representation corresponds to 
a multiplication by p ; that of the Q operator corresponds, as can easily be shown 

using (27), to the operation ih 

Thus we obtain symmetrical results in the { \q} } and { |p> } represen¬ 
tations. This is not surprising : in our hypotheses, it is possible to exchange the P 
and Q operators, simply changing the sign of the commutator in relation (2). 
Instead of introducing the operator 5(2), we could therefore have considered T(X') 
defined by: 

1\l') = t tx 'Qi h (28) 

and we could have developed the same arguments, replacing P by Q and i by - i 
everywhere. 

References: 

Messiah (1.17), Vol. I, §VIII-6; Dirac (1.13), §25; Merzbacher (1.16), chap. 14, §7. 
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Complement F ( | 

THE PARITY OPERATOR 


1. The parity operator 

a. Definition 

b. Simple properties of FI 

c. Eigensubspaces of FI 

2. Even and odd operators 

a. Definitions 

b. Selection rules 

c. Examples 

d. Functions of operators 

3. Eigenstates of an even observable B + 

4. Application to an important special case 

1. The parity operator 

a. DEFINITION 

Consider a physical system whose state space is The parity operator 17 is 


defined by its action on the basis vectors I r > of $ r *: 

77 | r > = | - r > (1) 

The matrix elements of n are therefore, in the { |r> } representation: 

< r | 71 1 r' > = < r | - r' > = 5(r + r') (2) 

Consider an arbitrary vector | i/r > of ^: 

| > = J<* 3 r i(f(r) \ r > (3) 

If the variable change r' = — r is performed, | > can be written : 

| i^(- O | — r' > (4) 

Now calculate 77 \ i)/ >; we obtain: 

n\ipy= |* d 3 r' \j/{— r') | r' > (5) 


* Care must be taken not to confuse | — r 0 > and — | r 0 >. The former is an eigenvector of R, 
with eigenvalue - r 0 and wavefunction £ _ To (r) = <5(r + r 0 ). The latter is an eigenvector of R with eigen¬ 
value r 0 and wavefunction - £ ro (r) = — ^(r — r 0 ). 
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Comparison of (3) and (5) shows that the action of 17 in the { |r> } representation 
is to change r to — r : 

<r|J7|^> = ^(- r) (6) 

Now let us consider a physical system 9 whose state vector is 1 1 // >; 771 if/ > des¬ 
cribes the physical system obtained from 9 by reflection through the origin of the 
axes. 

b. SIMPLE PROPERTIES OF 77 

The operator 17 2 is the identity operator. From (1) we have: 


T7 2 |r> = 17(77 |r » = 17 | - r > = |r> 

(7) 

that is, since the kets | r ) form a basis of S r : 


17 2 = t 

(8-a) 

or: 


17 = 17“ 1 

(8-b) 


It is easy to show by recurrence that the operator 17" is 

— equal to 1] when n is even 

— equal to 17 when n is odd 

We can rewrite (6) in the form: 


<r|77|^) = <-r|^> (9) 

Since this equation is valid for all 1), it can be deduced that: 

< r 117 = < — r | (10) 

Moreover, the Hermitian conjugate expression of (1) is written: 

< r 117' = < — r | (11) 

Since the kets | r ) form a basis, it follows from (10) and (11) that 17 is Hermitian: 

W = 17 (12) 

Combining this equation with (8-b), we obtain: 

17 -1 = W (13) 

17 is therefore unitary as well. 


c. EIGEN SUBSPACES OF 77 

Let | (p n y be an eigenvector of 17, with an eigenvalue of p n . Applying (8-a), 
we obtain: 

\<p K > = n 2 \<p n > =p 2 \<p«> (14) 
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We therefore have p\ = 1 : the eigenvalues of IT are limited to 1 and — 1. Since the 
space S r is infinite-dimensional, we immediately see that these eigenvalues are 
degenerate. An eigenvector of 77 with the eigenvalue + 1 will be said to be even; 
an eigenvector with the eigenvalue — 1, odd. 

Consider the two operators P+ and P_ defined by: 

p + ={(H+n) 

p. =^(*0- n) (15) 

These operators are Hermitian; using (8-a), it is easy to show that: 

Pi = P + 

Pi = P _ (16) 

P + and P_ are thus the projectors onto two subspaces of S t which we shall call S + 
and S _. Let us calculate the products P + P_ and P_P + ; we obtain: 


p + p_ =^(H+ n — n — n 2 ) = o 

p_p + =i(H- 77 + n - n 2 ) = o (17) 

The two subspaces S + and S _ are therefore orthogonal. Let us show that they are 
also supplementary. We see immediately from definition (15) that: 

P + + P_ = H (18) 

For all 1 1 (/} e S, we have, therefore: 

|*> = (P + + P_)|*>=|* + > + l*-> (19) 

with : 

|^ + > = P + |^> 

I > = P- | * > (20) 

Let us calculate the products 77P+ and TIP_ ; we obtain: 
nP + =\n(H+ 77) = ^(77 +U) = P + 

TIP- = 77(H — 77) = ^(77 — H) = — P_ (21) 


These equations enable us to show that the vectors | if/ + > and | > introduced 

in (20) are even and odd, respectively: 

77|^+> = 77P + |(^> = P + [^> = 11^+ > 

77 | > = TIP. | * > - - P_ | + > = - | > (22) 

The spaces + and S _ are therefore the eigensubspaces of 77, with the eigenvalues + 1 
and — 1. In the { |r> } representation, equations (22) can be written: 
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<r|</'+> = t^ + (r) = <r|/7|i^ + > = tf/ + (— r) 

<r| iA_> = i^_(r) = - < r | JT | ^ _ > = - i p _(- r) (23) 

The wave functions *p + (r) and >A-(r) are even and odd, respectively. 

Relation (19) expresses the fact, therefore, that any ket | ip > of S x can be 
decomposed into a sum of two eigenvectors of IT, \ij/ + > and |i (/.}, belonging 
respectively to the even subspace S\ and the odd subspace S _ . Therefore, IT is an 
observable. 


2. Even and odd operators 
a. DEFINITIONS 

In §2 of complement C„, we defined the concept of a unitary transformation 
of operators. In the case of IT [which is indeed unitary; see (13)], the transformed 
operator of an arbitrary operator B is written: 

B = nBU (24) 

and satisfies the relation \cf. equation (27) of complement C„]: 

<r|R|r') = <-r|R|-r'> (25) 

The operator B is said to be the parity transform of B. 

In particular, if: B = + B the operator B is said to be even 

if: B = — B the operator B is said to be odd. 

An even operator 5+ therefore satisfies: 

B + = nB + n (26) 

or, multiplying this equation on the left by IT and using (8-a): 

nB + = B + n (27) 

[IT,5 + ] = 0 (28) 

An even operator is therefore an operator which commutes with IT. It can be seen, 

similarly, that an odd operator B_ is an operator which anticommutes with IT: 

HB_ + B_n = 0 (29) 


b. SELECTION RULES 

Let B + be an even operator. Let us calculate the matrix element < (p \ B + \ ^ >; 
by hypothesis, we have: 

<15 + | (A > = <«p|lTS + IT|(A> = <p'|* + |*'> (30) 

\cp’> = n\<p> 

I tfO = IT|^> (31) 
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If one of the two kets, | (p > and | ij/ >, is even and the other odd (| (p ') = ± \ (p >, 
| \j/' > = + | ip >), relation (30) yields: 

Ol* + l*> = - (<p\B + \tj,> = 0 (32) 

Hence the rule : the matrix elements of an even operator are zero between vectors 
of opposite parity. 

If, now, B_ is odd, relation (30) becomes: 

<<p\B-\* > = - (Q'lBM'y (33) 

which is zero when | <p > and | \j/ ) are both either even or odd. Hence the rule: 
the matrix elements of an odd operator are zero between vectors of the same 
parity. In particular, the diagonal matrix element < i]/ 1 B_ \tj/ > (the mean value of B_ 
in the state | ^ >; cf. chapter III, §C-4) is zero if | ^ > has a definite parity. 


c. EXAMPLES 


a. The X, Y, Z operators 

In this case, we have : 


IIX |r> = FIX\x, y, z) = xFI \ x, y, z > 

= x \ - x, - y, - z) = x\ - r> 

(34) 

and: 


3 

II 

1 

II 

1 

X 

1 

X: 

1 

n 


= x| x, y, z> = - X1 - r> 

(35) 

Adding these two equations together, we obtain : 


{nx + xn)\ry = 0 

(36) 

or, since the vectors |r> form a basis: 


nx + xn = 0 

(37) 


X is therefore odd. 

The proofs are the same for Y and Z; R is therefore an odd operator. 


p. The P x , P y , P. operators 

Let us calculate the ket J7 | p >; we obtain: 

n | p > = (2nhy 311 J d 3 r e' p r/ *IJ | r > 

= (2nhy 312 |j 3 r e ipT/n | - r > 

= (2nTi)~ 312 J d 3 r' e _ip r /fi | r' > 

= I - P> 
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We then have, using an argument analogous to the one developed in a: 


np x 

O. 

1 

X 

Ci, 

II 

/N 

O. 


p x n 

|P>=-/>*|-P> 

(39) 

and 



np x 

+ p x n = 0 

(40) 

The P operator is odd. 



y. The parity operator 

fl obviously commutes with itself; it is an even operator. 


d. FUNCTIONS OF OPERATORS 

Let B + be an even operator. Using relation (8-a), we obtain: 

nB n + n = (riB + n)(nB + n)... ( nB + n ) = B n + 

\ ___ / 

n factors (41) 

An even operator raised to the nth power is evpn. Hence, any operator F{B+) is even. 
Let B_ be an odd operator; let us calculate the operator /75"_/7: 

nBin = (riB_n){nB_n)... (nB_n) = (- 1 f(B_f 

' - v - / 

n factors (42) 

An odd operator raised to the nth power is even if n is even, odd if n is odd. Consider 
an operator F(B_); this operator is even if the corresponding function F(z) is 
even, odd if it is odd. In general, F(B_) has no definite parity. 


3. Eigenstates of an even observable B + 

Let us consider an arbitrary even observable B + and an eigenvector | <p b > 
of B + with an eigenvalue of b. Since £ + is even, it commutes with Tl. Applying the 
theorems of§D-3-a of chapter II, we obtain the following results: 

a. If b is a non-degenerate eigenvalue, | (p b > is necessarily an eigenvector of fl ; 
it is therefore either an even or an odd vector. The mean value < cp b | B_ | cp b ) of 
any odd observable 2?_, such as R, P, etc..., is then zero. 

p. If b is a degenerate eigenvalue corresponding to the eigensubspace S b , 
the vectors of i b do not all necessarily have a definite parity. II \ <p b ) may be a vector 
which is non-collinear with | (p b ); it is nevertheless a vector which has the same 
eigenvalue b. Moreover, it is possible to find a basis of eigenvectors common to /7 
and B + in every subspace S b . 
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4. Application to an important special case 

We shall often need to find the eigenstates of a Hamiltonian operator H, 
acting in <f r , of the form: 

H = + K(R) (43) 

Since the P operator is odd, the P 2 operator is even. When, in addition, the 
function F(r) is even (F(r) = F( — r)), the operator H is even. According to 
what we have just seen, it is then possible to look for the eigenstates of H among 
the even or odd states. This often simplifies the calculation considerably. 

We have already encountered a certain number of cases where the Hamilton¬ 
ian H is even : the square well, the infinite well (cf. complement H,). We shall 
study others: the harmonic oscillator, the hydrogen atom, etc... It is easy to 
verify in all these special cases the properties which we have derived. 

COMMENT : 

If H is even, and if one of its eigenstates | (p h > which has no definite 
parity (17 | <p A > non-collinear to \<p h }) has been found, it can be asserted that 
the corresponding eigenvalue is degenerate : since 77 commutes with H, 
771 (p h y is an eigenvector of H with the same eigenvalue as | <p h >. 


References and suggestions for further reading : 

Schiff (1.18), §29; Roman (2.3), §5-3 d; Feynman I (6.3), chap. 52; Sakurai (2.7), 
chap. 3; articles by Morrison (2.28), Feinberg and Goldhaber (2.29), Wigner (2.30). 
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AN APPLICATION OF THE PROPERTIES 
OF THE TENSOR PRODUCT: 

THE TWO-DIMENSIONAL INFINITE WELL 


1. Definition; eigenstates 

2. Study of the energy levels 

a. Ground state 

b. First excited states 

c. Systematic and accidental degeneracies 


In complement H, (§ 2-c) we have already studied, in a one-dimensional problem, 
the stationary states of a particle placed in an infinite potential well. By using the 
concept of a tensor product (cf. chap. II, § F), we shall be able to generalize this 
discussion to the case of a two-dimensional infinite well (the introduction of a third 
dimension would not involve any additional theoretical difficulty). 


1. Definition; eigenstates 

We shall consider a particle of mass m, restricted to a plane xOy, inside 
a “square box” of edge a : its potential energy V(x, >’) becomes infinite when one 
of its coordinates x or y leaves the interval [0, a]: 

V(x,y)= V 0D (x) + I'00 (l) 

with: 

K>(u) = 0 if 

= + oo if u < 0 or u > a (2) 

The Hamiltonian of the quantum particle is then (chap. Ill, § B-5); 

H + P l ) + VJX) + ^(Y) ( 3 ) 

which can be written: 

H=H X + H y (4) 

with: 

H. = 2^ p i + K.W 

H > = ^, p ’ + yJY) ,5 ' 

We thus find ourselves in the important special case mentioned in chapter II 
(§F-4-a-/l), and we can consider the eigenstates of H in the form: 

\4>> = \<P> x \<P>y ( 6 ) 
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with: 

H, I V >, = E, | <p >, 

We then have: 

H\$y = E \#> 


<i> >y 6 


E = E X + E y (8) 

We have therefore reduced a two-dimensional problem to a one-dimensional 
problem, which, moreover, has already been solved (cf. complement H,). Applying 
the results of this complement, and formulas (7) and (8), we therefore see that: 
— the eigenvalues of H are of the form • 


E„ = 


( n 2 + p 2 ) n 2 h 2 


where n and p are positive integers. 

- to these energies correspond eigenstates | <£ n p > which can be written 
in the form of tensor products: 

I <P n . P >= \<Pn> x \<P P >y (10) 

whose normalized wave function is: 

y) = v P (y) 

2 . nnx . pny .... 

= - sin-sin^-^- (11) 

a a a 

It is easy to verify that these wave functions vanish at the edges of the “square box” 
(x or y = 0 or a), where the potential energy becomes infinite. 


2. Study of the energy levels 

a. GROUND STATE 

n and p are strictly positive integers*. The ground state is therefore obtained 
when n = 1, p = 1. Its energy is: 


This value is attained only for n = p = 1. The ground state is therefore not degene¬ 
rate. 


* The values n = 0 or p = 0, which give null wave functions (therefore impossible to normalize) 
are excluded. 
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b. FIRST EXCITED STATES 

The first excited state is obtained either for n = 1 and p = 2 or for n = 2 
and p = 1. Its energy is: 

*..i - *i.. - f ^ (13) 

2 ma 

This state is two-fold degenerate, since | 2 > and | <J> 2 ,i ) are independent. 

The second excited state corresponds to n = p = 2; it is not degenerate, 
and its energy is: 

.n 2 h 2 .... 

E 2 ,i = 4-7 (14) 

ma 

The third excited state corresponds to n = 1, p = 3 and n = 3, p = 1, etc. 

C. SYSTEMATIC AND ACCIDENTAL DEGENERACIES 

The general observation can be made that all levels for which n ^ p are degenerate, since: 

£„, P = £ P .„ (15) 

This degeneracy is related to a symmetry of the problem. The square well under consi¬ 
deration is symmetrical with respect to the first bisectrix of the xOy plane. This is expressed 
by the fact that the Hamiltonian H is invariant under the exchange : 


(In the state space, an operator could be defined to correspond to a reflection about the first 
bisectrix. It could then be shown that, in the present case, this operator commutes with H). 
If an eigenstate of H is known whose wave function is $(x, y). the state which corresponds 
to <P'(x, y) = ( P(y, x) is also an eigenstate of H with the same eigenvalue. Consequently, if the 
function <P(x, >■) is not symmetric with respect to x and y, the eigenvalue associated with it is 
necessarily degenerate. This is the origin of the degeneracy (15): for n p, <P n p (x, y) is not 
symmetrical with respect to x and y [formula (11)]. This interpretation is corroborated by the 
fact that if the symmetry is destroyed by choosing a well whose widths along Ox and along Oy 
are different (being equal to a and b respectively), the corresponding degeneracy disappears, 
and formula (9) becomes : 

£ = M(* + EL) (17) 

"• p 2m V bV’ 

which implies: 

Ep,n * Kv ( lg ) 

Such degeneracies, whose origin lies in a symmetry of the problem are called systematic 
degeneracies. 


COMMENT : 

The other symmetries of the two-dimensional square well do not create systematic 
degeneracies because the eigenstates of H are all invariant with respect to them. For 
example, for arbitrary n and p, <P np (x, y) is simply multiplied by a phase factor if x is 
replaced by (a - x) and y by (a — y) (symmetry with respect to the center of the well). 
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Degeneracies may also arise which are not directly related to the symmetry of the 
problem. They are called accidental degeneracies. For example, in the case which we have 
discussed, it so happens that E 5 5 = E 1 , and E 7A — E s 
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Complement H n 
EXERCISES 


Dirac notation. Commutators. Eigenvectors and eigenvalues 


1. | <p n ) are the eigenstates of a Hermitian operator H (H is, for example, the 

Hamiltonian of an arbitrary physical system). Assume that the states | <p n ) form 
a discrete orthonormal basis. The operator U(m, n) is defined by: 

U{m, n) = | <p m > < <p n \ 

a. Calculate the adjoint U\m , n) of U(m , n). 

b. Calculate the commutator [#, U(m, «)]. 

c. Prove the relation: 

U(m, n)U\p, q) = 8 nq U{m, p) 

d. Calculate Tr { U(m, n )}, the trace of the operator U(m, n). 

e. Let A be an operator, with matrix elements A mn = < (p m \ A \ <p n >. Prove 
the relation: 


A = I A mn U(m , n) 

m,n 


f. Show that A pq = Tr { AU\p , q) }. 


2. In a two-dimensional vector space, consider the operator whose matrix, in an 
orthonormal basis { | 1 >, | 2 ) }, is written: 



a. Is a y Hermitian ? Calculate its eigenvalues and eigenvectors (giving their 
normalized expansion in terms of the { | 1 >, | 2 ) } basis). 

b. Calculate the matrices which represent the projectors onto these eigen¬ 
vectors. Then verify that they satisfy the orthogonality and closure relations. 

c. Same questions for the matrices: 


M = 


2 iV2 

- i sTl 3 
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and, in a three-dimensional space 


/ 


0 



V o 


sTl 0 \ 

0 V2 
- \/2 0 / 


3. The state space of a certain physical system is three-dimensional. Let 

{ | «i ), | u 2 >, | « 3 > } be an orthonormal basis of this space. The kets | t p 0 > 
and | > are defined by: 

l^ 0>= 72 I Mi>+ iI“ 2>+ i | m3> 

ki > = ^ i ^ i « 3 > 

а. Are these kets normalized ? 

б . Calculate the matrices p 0 and p x representing, in the {| ), | u 2 >, | u 3 > } 

basis, the projection operators onto the state | > and onto the state I >. Verify 

that these matrices are Hermitian. 

4. Let K be the operator defined by K = | (p ) < ^ |, where | (p ) and | ip > are two 
vectors of the state space. 

a. Under what condition is K Hermitian ? 

b. Calculate K 2 . Under what condition is AT a projector? 

c. Show that K can always be written in the form K = AP 1 P 2 where X is 
a constant to be calculated and P l and P 2 are projectors. 

5. Let P x be the orthogonal projector onto the subspace S x , P 2 the orthogonal 
projector onto the subspace tf 2 . Show that, for the product P X P 2 to be an orthogonal 
projector as well, it is necessary and sufficient that P 1 and P 2 commute. In this 
case, what is the subspace onto which P X P 2 projects? 

6 . The a x matrix is defined by: 



Prove the relation: 

giaffx _ / cos a _|_ s j n a 

where / is the 2 x 2 unit matrix. 


7. Establish, for the er„ matrix given in exercise 2, a relation analogous to the 
one proved for g x in the preceding exercise. Generalize for all matrices of the 
form: 


g u = Xg x +• pa y 


204 




EXERCISES 


with: 

A 2 + n 2 = 1 

Calculate the matrices representing e 2i<T *, (e'"*) 2 and e 1( ' T * + ' Ty) . Is e 2 '"* equal 
to (e ,v *) 2 ? e'<‘ , * +<,y) to e i<r * e i<Ty ? 

8 . Consider the Hamiltonian H of a particle in a one-dimensional problem 
defined by: 

H --k p2 + v w 

where X and P are the operators defined in § E of chapter II and which satisfy the 
relation: \_X,P~\ = ih. The eigenvectors ofH are denoted by \<Pn>‘ H \<Pn> = E n \(p„>, 
where n is a discrete index. 

a. Show that: 

< <P n I p I <Pn- > = « < <Pn | * I <P n - > 

where a is a coefficient which depends on the difference between E n and E n ,. 
Calculate « (hint: consider the commutator \_X, //]). 

b. From this, deduce, using the closure relation, the equation: 

I (E m ~ E n .) 2 |<<p n |x|^ n ,>| 2 = *-(cp n \P 2 \<p n y 

rr m 

9 . Let H be the Hamiltonian operator of a physical system. Denote by | <p n > the 
eigenvectors of H, with eigenvalues E n : 

ff\<Pn> = E n \<P n > 

a. For an arbitrary operator A, prove the relation: 

< <Pn | U //] | Q n > = 0. 

b. Consider a one-dimensional problem, where the physical system is a 
particle of mass m and of potential energy V(X). In this case, H is written: 

H =i, p2 + V(X > 

«. In terms of P, X and V(X), find the commutators: [//, P\ \_H, X ] 
and [ H , XP ]. 

0. Show that the matrix element < <p„ \ P \ <p„ > (which we shall interpret 
in chapter III as the mean value of the momentum in the state | <p„ }) is zero. 

P 2 

y. Establish a relation between E k = < <p n | | <p n > (the mean value of the 

kinetic energy in the state | <p n >) and < <p„ | <p„ >. Since the mean value of 

the potential energy in the state | (p n > is < <p n | F(x) | <p n >, how is it related to the 
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mean value of the kinetic energy when: 

V{X) = V 0 X x 
(A= 2, 4,6...; V 0 > 0)? 


10 Using the relation < x \ p > = (Inti ) 1 12 e ,px/h , find the expressions < x | XP \ ) 

and < x | PX | t/' > in terms of ift(x). Can these results be found directly by using 

the fact that in the { I x ) } representation, P acts like 4 -r- ? 

/ dx 


Sets of commuting observables and C.S.C.O'S 

1 1 . Consider a physical system whose three-dimensional state space is spanned 
by the orthonormal basis formed by the three kets | « x ), | u 2 ), | u 3 >. In the basis 
of these three vectors, taken in this order, the two operators H and B are 
defined by: 



a 

0 



(\ 

0 


o 

3 

+s; 

II 

EC 

0 

- 1 

0 

B = b 

0 

0 

1 

i 

\o 

0 

- lj 


lo 

1 

0 J 


where co 0 and b are real constants. 

a. Are H and B Hermitian ? 

b. Show that H and B commute. Give a basis of eigenvectors common to H 
and B. 

c. Of the sets of operators: { H }, { B },{//, B },.{ H 2 , B }, which form 
a C.S.C.O. ? 

12. In the same state space as that of the preceding exercise, consider two 
operators L z and S defined by: 

L z |Mj y = | u l y L z |« 2 y = o L z \u 3 y = — \ u 3 y 

s I M i > = I «3 > s j » 2 > = | u 2 y s j u 3 y = |«! > 

a. Write the matrices which represent, in the { | Mj ), | u 2 >, | u 3 ) } basis, 
the operators L z , Lj, S, S 2 . Are these operators observables? 

b. Give the form of the most general matrix which represents an operator 
which commutes with L z . Same question for L 2 , then for S 2 . 

c. Do L 2 and S form a C.S.C.O. ? Give a basis of common eigenvectors. 
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Solution of exercise 11 


a. H and B are Hermitian because the matrices which correspond to them 
are symmetric and real. 

b. | u l ) is an eigenvector common to H and B. We therefore have, obviously, 
HB | u l ) = BH | Mj ). We see, then, that for H and B to commute, it is sufficient 
that the restrictions of these operators to the subspace S 2 , spanned by | u 2 ) 
and | « 3 ), commute. Now, in this subspace, the matrix representing H is equal 
to — 7ia> 0 /(where/isthe2 x 2 unit matrix), which commutes with all 2 x 2 matrices. 
H and B therefore commute (this result could, of course, be obtained by calcu¬ 
lating directly the matrices HB and BH). The restriction of B to S 2 is written: 


ft BP, = b\ 


0 1 
1 0 , 

The normalized eigenvectors of this 2x2 matrix are easy to obtain; they are : 

| Pi > = —F [ I «2 > + | u 3 >] (eigenvalue + b) 
v 2 


P 3 > = —- [ | u 2 > - | «3 >] (eigenvalue - b) 

>J2 


These vectors are automatically eigenvectors of H since S 2 is the eigensubspace 
of H corresponding to the eigenvalue — hio 0 . To summarize, the eigenvectors 
common to H and B are given by: 


I Pi > = K > 

| Pi > = - 4 ^ [ | »2 > + | «3 >] 

| Pi > = [ | «2 > - | ”3 >] 


eigenvalue of H eigenvalue of B 


ha) Q 


b 


- Tuo 0 


b 


- boj 0 


- b 


These vectors are the only (to within, of course, a phase factor) normalized eigen¬ 
vectors common to H and B. 

c. It can be seen from the table that H has a two-fold degenerate 
eigenvalue; it is therefore not a C.S.C.O. Similarly, B also has a two-fold degenerate 
eigenvalue and is therefore not a C.S.C.O.: an eigenvector of B with the 

eigenvalue b can be \p^ >,or | p 2 >, or —\= 

v 3 

example. On the other hand, the set of the two operators H and B does constitute 
a C.S.C.O. We see from the above table that no two vectors | pj > have the same 
eigenvalues for both H and B. This is why, as has already been pointed out, the 
system of normalized eigenvectors common to H and B is unique (to within phase 
factors). Note that within the eigensubspace S 2 of H associated with the eigen¬ 
value — hu> 0 , the eigenvalues of B are distinct (b and — b ). Similarly, in the 


<i > + ~f I u 2 > + —- I m 3 X for 
v 3 v 3 
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eigensubspace of B spanned by | p x > and | p 2 >, the eigenvalues of H are distinct 
( ho) 0 and — Tioi 0 ). 

H 2 has for eigenvectors, with the eigenvalue Ti 2 oi\, \ p x >, | p 2 ) and | p 3 ). 
It is easy to see that H 2 and B do not constitute a C.S.C.O., since two linearly 
independent eigenvectors | p t > and | p 2 > correspond to the pair of eigenvalues 
{Ti 2 a>l,b }. 


Solution of exercise 12 

a. Let us use the rule for constructing the matrix of an operator : “ in the nth 
column of the matrix, write the components of the operator transform of the nth basis 
vector”. We obtain easily : 



f 1 

0 


0 N 


^0 

0 

1\ 

K = 

0 

0 


0 

s = 

0 

1 

0 


lo 

0 

— 

ij 


ll 

0 

0/ 


(\ 

0 




n 

0 

°\ 

Ll = 

0 

0 

0 


s 2 = 

0 

1 

0 


lo 

0 

ij 




0 

1/ 


These matrices are symmetric and real, and therefore Hermitian. Since the space 
is finite-dimensional, they can be diagonalized and therefore represent observables. 

b. Let M be an operator which commutes with L z . M cannot (cf. chap. II, 
§D-3-a) have any matrix elements between | > and | u 2 >, or between | u 2 > 

and | u 3 ), or between | «, ) and | w 3 > (eigenvectors of L z with different eigenvalues). 
The matrix which represents M is therefore necessarily diagonal, that is, of the 
form: 





^11 

0 

0 ^ 

[M, L 2 ] = 0 


M = 

0 

m 22 

0 




lo 

0 

W 33/ 


Let N be an operator which commutes with L\. The matrix representing N can have 
elements between | u t ) and | n 3 > (eigenvectors of L z with the same eigenvalue), but 
none between | u 2 ) and | u l > or | u 3 >. N is therefore written: 






0 

«i 3 \ 

[iv, Ll] = 0 

— 

N = 

0 

n 22 

0 




Uai 

0 

» 33 / 


It is therefore less restrictive to impose the condition that an operator commute 
with L 2 than with L z : N is not necessarily a diagonal matrix. It can only be said 
that N does not mix the vectors of the subspace spanned by | ) and | u 3 ) 

with those of the one-dimensional subspace spanned by | u 2 >. This property, 
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No two lines are alike in the table of eigenvalues of L 2 Z and S : these two operators 
therefore form a C.S.C.O. (this is not, however, the case for either one of them 
taken alone). 
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In classical mechanics, the motion of any physical system is determined if 
the position r(x, y, z) and velocity v(x, y, z) of each of its points are known as a 
function of time. In general (appendix III), to describe such a system, one introduces 
generalized coordinates q { (t) (t = 1, 2, N), whose derivatives with respect to 
time, q { (t), are the generalized velocities. Specifying the q t {t) and q t (t) enables us 
to calculate, at any given instant, the position and velocity of any point of the 
system. Using the Lagrangian ^{q t , q,, t), one defines the conjugate momentum p i 
of each of the generalized coordinates q f : 



(A-l) 


The q,(t) and p t (t) (i = 1, 2, ..., N) are called the fundamental dynamical 
variables. All the physical quantities associated with the system (energy, angular 
momentum, etc.) can be expressed in terms of the fundamental dynamical 
variables. For example, the total energy of the system is given by the classical 
Hamiltonian (q., p t , t). The motion of the system can be studied by using 
either Lagrange’s equations or the Hamilton-Jacobi canonical equations, which 
are written: 


d q± _ dJf_ 
dt ~ dpi 

dpt _ dJT 

dt ~ dq ( 


(A-2-a) 

(A-2-b) 


In the special case of a system consisting of a single physical point of mass m , 
the q { are simply the three coordinates of this point, and the q { are the components 
of its velocity v. If the forces acting on this particle can be derived from a 
scalar potential V(r, t), the three conjugate momenta of its position r (that is, 
the components of its linear momentum p) are equal to the components of 
its mechanical momentum mv. The total energy is then written: 

E = 2 ^ +K(M) (A ' 3) 


(A-4) 


and the angular momentum with respect to the origin: 
J?= r x p 
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Since Jf(r, p, t) = + V(r, r), the Hamilton-Jacobi equations (A-2) here take on 

the well-known form: 


dr _ p 

df m 


dp 

dt 


- VV 


(A-5-a) 

(A-5-b) 


The classical description of a physical system can therefore be summarized 
as follows: 

(/) The state of the system at a fixed time t 0 is defined by specifying N 
generalized coordinates #,(r 0 ) and their N conjugate momenta p,(/ 0 ). 

(ii) The value, at a given time, of the various physical quantities is 
completely determined when the state of the system at this time is known: 
knowing the state of the system, one can predict with certainty the result of any 
measurement performed at time t 0 . 

(iii) The time evolution of the state of the system is given by the Hamilton- 
Jacobi equations. Since these are first-order differential equations, their solution 
{ <7,(0* Pit 1 ) } is unique if the value of these functions at a given time t 0 is fixed, 
{ <7,(7o)> Pi( l o) }• The state of the system is known for all time if its initial state 
is known. 

In this chapter, we shall study the postulates on which the quantum 
description of physical systems is based. We have already introduced them, 
in a qualitative and partial way, in chapter I. Here we shall discuss them 
explicitly, within the framework of the formalism developed in chapter II. 
These postulates will provide us with an answer to the following questions 
(which correspond to the three points enumerated above for the classical 
description): 

(i) How is the state of a quantum system at a given time described 
mathematically ? 

(ii) Given this state, how can we predict the results of the measurement of 
various physical quantities ? 

(iii) How can the state of the system at an arbitrary time t be found 
when the state at time t 0 is known ? 

We shall begin by stating the postulates of quantum mechanics (§B). 
Then we shall analyze their physical content and discuss their consequences 
(§§C, D, E). 


B. STATEMENT OF THE POSTULATES 

1. Description of the state of a system 

In chapter I, we introduced the concept of the quantum state of a particle. 
We first characterized this state at a given time by a square-integrable wave 
function. Then, in chapter II, we associated a ket of the state space S x with 
each wave function : choosing | ip ) belonging to i t is equivalent to choosing 
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the corresponding function ip(r) = < r | ip >. Therefore, the quantum state of 
a particle at a fixed time is characterized by a ket of the space In this 
form, the concept of a state can be generalized to any physical system. 


First Postulate: At a fixed time t 0 , the state of a physical system is defined 
by specifying a ket 11^(/ 0 )) belonging to the state space S. 


It is important to note that, since S is a vector space, this first postulate 
implies a superposition principle : a linear combination of state vectors is a 
state vector. We shall discuss this fundamental point and its relations to 
the other postulates in §E. 

2. Description of physical quantities 

We have already used, in §D-1 of chapter I, a differential operator H 
related to the total energy of a particle in a scalar potential. This is simply a 
special case of the second postulate. 


Second Postulate: Every measurable physical quantity si is described by an 
operator A acting in S ; this operator is an observable. 


comments : 

(i) The fact that A is an observable ( cf. chap. II, §D-2) will be seen below (§3) 
to be essential. 

(») Unlike classical mechanics (cf § A), quantum mechanics describes in a funda¬ 
mentally different manner the state of a system and the associated physical 
quantities : a state is represented by a vector, a physical quantity by an 
operator. 

3. The measurement of physical quantities 

a. POSSIBLE RESULTS 

The connection between the operator H and the total energy of the particle 
appeared in §D-1 of chapter I in the following form : the only energies possible 
are the eigenvalues of the operator H. Here as well, this relation can be 
extended to all physical quantities. 


Third Postulate: The only possible result of the measurement of a physical 
quantity is one of the eigenvalues of the corresponding observable A. 


COMMENTS : 


(/) A measurement of always givec a real value, since A is by definition 
Hermitian. 
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(ii) If the spectrum of A is discrete, the results that can be obtained by 
measuring si are quantized (§ C-2). 

b. PRINCIPLE OF SPECTRAL DECOMPOSITION 

We are going to generalize and discuss in more detail the conclusions 
of § A-3 of chapter I, where we analyzed a simple experiment performed on 
polarized photons. 

Consider a system whose state is characterized, at a given time, by the 
ket | ip ), assumed to be normalized to 1: 

<ip\ip>=l (B-l) 

We want to predict the result of the measurement, at this time, of a physical 
quantity si associated with the observable A. This prediction, as we already 
know, is of a probabilistic sort. We are now going to give the rules which 
allow us to calculate the probability of obtaining any given eigenvalue of A. 

a. Case of a discrete spectrum 

First, let us assume that the spectrum of A is entirely discrete. If all 
the eigenvalues a n of A are non-degenerate, there is associated with each of them 
a unique (to within a constant factor) eigenvector | u n ): 

A I «» > = I M n > (B‘2) 

Since A is an observable, the set of the | u n >, which we shall take to be 
normalized, constitutes a basis in g, and the state vector | ip > can be written: 

I ^ > = Z C n I U « > (B-3) 

n 

We postulate that the probability &(a n ) of finding a„ when si is measured is: 
&(<*„) = kl 2 = \< u n I •Z' >l 2 (B-4) 


Fourth Postulate (case of a discrete non-degenerate spectrum): When the 
physical quantity si is measured on a system in the normalized state | \p >, 
the probability 0>(a n ) of obtaining the non-degenerate eigenvalue a n of the 
corresponding observable A is: 

= l< «„ I t >l 2 

where | u n > is the normalized eigenvector of A associated with the eigenvalue 
a n - 

If, now, some of the eigenvalues a n are degenerate, several orthonormalized 
eigenvectors | u‘„ ) correspond to them : 

A I K > = a n\ K > : i = U 2, ... g„ (B-5) 

| ip > can still be expanded in the orthonormal basis { | u l n ) }: 

g n 

I * > = Z Z 4 k > (B-6) 
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In this case, the probability 0>{a n ) becomes: 

On Qn 

*K> - I Kl 2 = I |< < | * >P (B-7) 

i=l t = 1 

(B-4) is then seen to be a special case of (B-7), which can therefore be 
considered to be the general formula. 


Fourth Postulate (case of a discrete spectrum): When the physical quantity sd is 
measured on a system in the normalized state the probability f?(a n ) of 
obtaining the eigenvalue a„ of the corresponding observable A is: 

9n 

&(“„) = Z |< K | >| 2 

i — 1 

where g n is the degree of degeneracy of a n and { | n' > } (i = 1,2,..., gj is an 
orthonormal set of vectors which forms a basis in the eigensubspace S n 
associated with the eigenvalue a n of A. 


For this postulate to make sense, it is obviously necessary that, if the eigen¬ 
value a is degenerate, the probability SP (<a n ) be independent of the choice of 
the { | u\) } basis in 8 n . To verify this, consider the vector: 

On 

I <An > = I cj, | «• > (B-8) 

i = I 

where the coefficients c‘ n are the same as those appearing in the expansion (B-6) 
of I fp >: 

< = < < | if, > (B-9) 

| i{/ n ) is the part of | ^ ) which belongs to 8 n , that is, the projection of 11 // ) onto 8„. 
This is, moreover, what we find when we substitute (B-9) into (B-8): 


I <A„ > = 9 i\<X<\'l'> 

i= 1 


= Pn \ + > 

(B-10) 

where: 


On 


P n = I |«' > < K | 

(B-11) 


i = 1 


is the projector onto 8 n (§ B-3-b of chapter II). Let us now calculate the square of 
the norm of | tf/ n }. From (B-8): 

< tn | ^ > = I k‘| 2 (B-12) 

i = 1 

Therefore, SP (a n ) is the square of the norm of \ij/ n } = P n \ij/), the projection of | ^ ) 
onto 8 n . From this expression, it is clear that a change in the basis in S n does not 
affect &P(a n ). This probability is written: 

2P(a n ) = (B-13) 
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or, using the fact that P n is Hermitian (P' n = P n ) and that it is a projector {Pi — P n ): 
?(a m ) = < + \PM> (B-14) 

p. Case of a continuous spectrum 

Now let us assume that the spectrum of A is continuous and, for the sake 
of simplicity, non-degenerate. The system, orthonormal in the extended sense, of 
eigenvectors | v a > of A: 

A\ v a y = <x\v a ) (B-15) 

forms a continuous basis in S, in terms of which | >p > can be expanded: 

| ip > = Jdac(a)| v a > (B-16) 

Since the possible results of a measurement of si form a continuous set, we must 

define a probability density, just as we did for the interpretation of the wave 

function of a particle (§ B-2 of chapter I). The probability d^(a) of obtaining 

a value included between a and a -I- da is given by: 

d^(a) = p{ a) da 

with: 

p(a) = k(a)| 2 = \{v a \tpy \ 2 (B-17) 


Fourth Postulate (case of a continuous non-degenerate spectrum): When the 
physical quantity si is measured on a system in the normalized state | tp >, 
the probability d^(a) of obtaining a result included between a and a + da 
is equal to: 

d^»(a) = \(v a \tp} | 2 da 

where | v a ) is the eigenvector corresponding to the eigenvalue a of the obser¬ 
vable A associated with si. 


comments: 

(/) It can be verified explicitly, in each of the cases considered above, that the 
total probability is equal to 1. For example, starting with formula (B-7), 
we find: 


I = E i Kl 2 = <lp\lp> = l (B-18) 

n n i = 1 

since | tp > is normalized. This last condition is therefore indispensable if the 
statements we have made are to be coherent. Nevertheless, it is not essential : 
if it is not fulfilled, it suffices to replace (B-7) and (B-17), respectively, by: 
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and: 

P<«) - M*)| 2 (B-20) 

(//) For the fourth postulate to be coherent, it is necessary for the operator A 
associated with any physical quantity to be an observable: it must be possible 
to expand any state on the eigenvectors of A. 

(iii) We have not given the fourth postulate in its most general form. Starting 
with the discussion of the cases we have envisaged, it is simple to extend 
the principle of spectral decomposition to any situation (continuous degene¬ 
rate spectrum, partially continuous and partially discrete spectrum, etc...). 
In §E, and later in chapter IV, we shall apply this fourth postulate to a 
certain number of examples, pointing out certain implications of the super¬ 
position principle mentioned in § B-1. 

y. An important consequence 

Consider two kets | ip > and | tp' > such that: 

l<A') = e i# | iP} (B-21) 

where 6 is a real number. If | tp > is normalized, so is | ip' >: 

<<P' l«n = < I e -ifl e M | ip > = OP\iP) (B-22) 

The probabilities predicted for an arbitrary measurement are the same for \\p > 
and | ip' > since, for any | u‘ n >: 

I < K I <A' > I 2 = I e" < K | ip > | 2 = I < | ip > | 2 (B-23) 

Similarly, we can replace | tp > by : 

\r> = «e i6 \ip> (B-24) 

without changing any of the physical results : there appear, in both the numerator 
and denominator of (B-19) and (B-20), factors of |a| 2 which cancel. Therefore, 
two proportional state vectors represent the same physical state. 

Care must be taken to interpret this result correctly. For example, let us 
assume that : 

|^> = 1 1 |^ 1 > + A 2 |^ 2 > (B-25) 

where and a 2 are complex numbers. It is true that e ,6 ‘ \ ip 1 ) represents, for all 
real 6 1 , the same physical state as l^iX ar, d e ,#2 |^ 2 ) represents the same state 
as | \p 2 ). But, in general: 

I <P > = e“‘ | i/'j > + A 2 e“ 2 1 ip 2 > (B-26) 

does not describe the same state as | ip > (we shall see in § E-1 that the relative 
phases of the expansion coefficients of the state vector play an important 
role). This is not true for the special case where 6 t — 0 2 + 2wr, that is, where : 

\(p> = e ifl| [A I ip, > + A 2 1 ip 2 >] = e" 1 1 ip > (B-27) 
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In other words: a global phase factor does not affect the physical predictions, but 
the relative phases of the coefficients of an expansion are significant. 


c. REDUCTION OF THE WAVE PACKET 

We have already introduced this concept in speaking of the measurement 
of the polarization of photons in the experiment described in §A-3 of chapter I. 
We are now going to generalize it, confining ourselves, nevertheless, to the case 
of a discrete spectrum (we shall take up the case of a continuous spectrum in §E). 

Assume that we want to measure, at a given time, the physical quantity si. 
If the ket | ip ), which represents the state of the system immediately before the 
measurement, is known, the fourth postulate allows us to predict the probabilities 
of obtaining the various possible results. But when the measurement is actually per¬ 
formed, it is obvious that only one of these possible results is obtained. Immediately 
after this measurement, we cannot speak of the “probability of having obtained” 
this or that value: we know which one was actually obtained. We therefore 
possess additional information, and it is understandable that the state of the 
system after the measurement, which must incorporate this information, should be 
different from | if ). 

Let us first consider the case where the measurement of si yields a simple 
eigenvalue a n of the observable A. We then postulate that the state of the system 
immediately after this measurement is the eigenvector | u n ) associated with a n : 

(B-28) 


comments: 

(») We have been speaking about states “immediately before” the measure¬ 
ment ((0>) and “immediately after” (|u n >). The precise meaning of 
these expressions is the following: assume that the measurement takes place 
at the time t 0 > 0, and that we know the state | i//(0) > of the system at the 
time t = 0. The sixth postulate (see § 4) describes how the system evolves 


Measurement giving 
the result a„ 


K> 


lfW> 


m > 


I H ( o) > 


H- *■( 

t, 


FIGURE 1 

When a measurement at time t 0 of the observable A gives the result a n , the state vector of 
the system undergoes an abrupt modification and becomes | u n >. This new initial state then 
evolves. 
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over time, that is, enables us to calculate from |i/^(0)> the state | ip(t 0 ) ) 
“immediately before” the measurement. If the measurement has yielded the 
non-degenerate eigenvalue a„, the state | <p'(t 1 )) at a time t l > t 0 must be 
calculated from | *l/'(t 0 ) > = | u n ), the state “immediately after” the measure¬ 
ment, using the sixth postulate to determine the evolution of the state vector 
between the times t 0 and t x (fig. 1). 

(ii) If we perform a second measurement of si immediately after the first one 
(that is, before the system has had time to evolve), we shall always find the 
same result a n , since the state of the system immediately before the second 
measurement is | u n ), and no longer | \f /). 

When the eigenvalue a n given by the measurement is degenerate, postu¬ 
late (B-28) can be generalized as follows. If the expansion of the state | \J/ > immedia¬ 
tely before the measurement is written, with the same notation as in section b: 

g n 

k > = £ £ c‘ | «' > (B-29) 

n i — 1 

the modification of the state vector due to the measurement is written: 




(gw). 


1 


/ 9 » 

JX 

V i=l 


o*l 2 


X < 

i= 1 


K > 


(B-30) 


9n 

E c ‘ n \K ) is the vector |^ n > defined above [formula (B-8)], that is, the projec- 

i=l 

tion of \\j/y onto the eigensubspace associated with a n . In (B-30), we normalized 
this vector since it is always more convenient to use state vectors of norm 1 
[comment ( i) of §b above]. With the notation of (B-10) and (B-l 1), we can therefore 
write (B-30) in the form : 





PJ^> 

V < | p„ | > 


(B-31) 


Fifth Postulate: If the measurement of the physical quantity si on the system 
in the state | ip > gives the result a n , the state of the system immediately after the 

p.\*y 


measurement is the normalized projection, 
eigensubspace associated with a n . 


V<«H p n <A> 


=, of 1 1 {/} onto the 


The state of the system immediately after the measurement is therefore 
always an eigenvector of A with the eigenvalue a n . We stress the fact, however, that 
it is not an arbitrary ket of the subspace $ n , but the part of | (// ) which belongs to $ n 
(suitably normalized, for convenience). In the light of § 3-b-y above, equation (B-28) 
can be seen to be a special case of (B-30). When g n = I, the summation over / 
disappears from (B-30), which becomes: 

1~| c » I M n > = e iArgc " | u„ > (B-32) 
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This ket indeed describes the same physical state as | u n >. 


4. Time evolution of systems 

We have already presented, in § B-2 of chapter I, the Schrodinger equation 
for one particle. Here we shall write it in the general case. 


Sixth Postulate: The time evolution of the state vector | \J/(t )) is governed 
by the Schrodinger equation: 

m±- t \m> = m\m> 

where H(t) is the observable associated with the total energy of the system. 


H is called the Hamiltonian operator of the system, as it is obtained from 
the classical Hamiltonian (appendix III and §5 below). 


5. Quantization rules 

We are finally going to discuss how to construct, for a physical quantity si 
already defined in classical mechanics, the operator A which describes it in quantum 
mechanics. 

a. STATEMENT 

Let us first consider a system composed of a single particle, without spin, 
subject to a scalar potential. In this case: 


With the position r(x, y, z) of the particle is associated the observable R(T, Y, Z). 
With the momentum p(p x ,p y ,p z ) of the particle is associated the obser¬ 
vable P(/>„ P y , P z ). 


Recall that the components of R and P satisfy the canonical commutation rela¬ 
tions [chap. II, equations (E-30)]: 

[/?,-, Rj] = [/>;, />,-] = 0 

[/?„/>,.] = (B-33) 

Any physical quantity si related to this particle is expressed in terms of the 
fundamental dynamical variables r and p: s/( r, p, t). To obtain the corresponding 
observable A, one could simply replace, in the expression for stf( r, p, t ), the 
variables r and p by the observables R and P* : 

40 = ^(R,P,0 (B-34) 


* See, in complement B„. the definition of a function of an operator. 
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However, this mode of action would be, in general, ambiguous. Assume, for 
example, that in si ( r, p, t) there appears a term of the form : 

r • P = xp x + yp y + zp z (B-35) 

In classical mechanics, the scalar product r. p is commutative, and one can just 
as well write: 

p . r = + p y y + p z z (B-36) 

But when r and p are replaced by the corresponding observables R and P, the 
operators obtained from (B-35) and (B-36) are not identical [see relations (B-33)] : 

R . P # P . R (B-37) 

Moreover, neither R . P nor P . R is Hermitian: 

(R . P)* = (XP x + YP y + ZP z y = P . R (B-38) 

To the preceding postulates, therefore, must be added a symmetrization rule. 
For example, the observable associated with r . p will be : 

^(R.P + P.R) (B-39) 

which is indeed Hermitian. For an observable which is more complicated than R . P, 
an analogous symmetrization is to be performed. 


The observable A which describes a classically defined physical quantity si 
is obtained by replacing, in the suitably symmetrized expression for si, r and p 
by the observables R and P respectively. 


We shall see, however, that there exist quantum physical quantities which 
have no classical equivalent and which are therefore defined directly by the cor¬ 
responding observables (this is the case, for example, for particle spin). 

comment: 

The preceding rules, and commutation rules (B-33) in particular, are 
valid only in cartesian coordinates. It would be possible to generalize them 
to other coordinate systems; however, they would no longer have the same 
simple form as they do above. 

b. IMPORTANT EXAMPLES 

a. The Hamiltonian of a particle in a scalar potential 

Consider a (spinless) particle of charge q and mass m, placed in an electric 
field derived from a scalar potential £/(r). The potential energy of the particle is 
therefore V(r) = qU{ r), and the corresponding classical Hamiltonian is writ¬ 
ten [appendix III, formula (29)]: 

^(r, p) = + V(r) (B-40) 
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with: 

p = m-^ = mv (B-41) 

where v is the particle’s velocity. 

No difficulties are presented by the construction of the quantum operator H 
which corresponds to . No symmetrization is necessary, since neither 
P 2 = P\ + P 2 + P} nor F(R) involves products of noncommuting operators. 
We therefore have: 

H = -jr- + F(R) (B-42) 


T(R) is the operator obtained by replacing r by R in V(r) (cf. complement B n , § 4). 

In this particular case, the Schrodinger equation, given in the sixth postu¬ 
late, becomes: 






P 2 

2m + K(R) 


m> 


(B-43) 


(B. The Hamiltonian of a particle in a vector potential 

If the particle is now placed in an arbitrary electromagnetic field, the clas¬ 
sical Hamiltonian becomes [appendix III, relation (66)] : 

P) = 2m [P “ qA{X ’ + qU{r ' (B-44) 

where U( r, t) and A(r, t ) are the scalar and vector potentials which describe the 
electromagnetic field, and where p is given by: 

p = m ^ -I- qA(r, t) = mv + A(r, t ) (B-45) 

Once again, since A(r, t) depends only on r and the parameter t (and not on p), 
construction of the corresponding quantum operator A(R, t ) presents no problem. 
The Hamiltonian operator H is then given by: 

m = 2- [P - <?A(R, 0] 2 + V(R, t ) (B-46) 


with 


T(R, t) = q U( R, t) (B-47) 

and the Schrodinger equation is written: 

I m > = {2^ [P - <?A(R, 0] 2 + K(R, o}| m > (B-48) 
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COMMENT : 

Care must be taken not to confuse p (the momentum of the particle, 
also called the conjugate momentum of r) with my (the mechanical momen¬ 
tum of the particle): the difference between these two quantities appears 
clearly in (B-45). In quantum mechanics, there of course exists an operator 
associated with the velocity of the particle which is written here: 

r=±(F-qA) (B-49) 

H is then given by : 

H(t) =± m r 2 + L(R, t) (B-50) 

It is the sum of two terms, one corresponding to the kinetic energy and the 
other to the potential energy of the particle. 

However, it is the conjugate momentum p and not the mechanical 
momentum my which becomes in quantum mechanics the operator P which 
satisfies the canonical commutation relations (B-33). 


C. THE PHYSICAL INTERPRETATION 

OF THE POSTULATES CONCERNING OBSERVABLES 
AND THEIR MEASUREMENT 


1. The quantization rules are consistent 

with the probabilistic interpretation of the wave function 


It is natural to associate the observables R and P, whose action was defined 
in §E of chapter II, with the position and momentum of a particle. First of all, 
each of the observables X, Y, Z and P x , P y , P 2 possesses a continuous spectrum, 
and experiments indeed show that all real values are possible for the six position 
and momentum variables. Moreover, we shall see that applying the fourth postu¬ 
late to the case of these observables enables us to re-derive the probabilistic inter¬ 
pretation of the wave function as well as that of its Fourier transform (see §§B-2 
and C-3 of chapter I). 

Let us consider, for simplicity, the one-dimensional problem. If the particle 
is in the normalized state | ), the probability that a measurement of its position 

will yield a result included between x and x + dx is equal to [formula (B-17)]: 

d<?(x) = |<x|»A>| 2 dx (C-l) 

where | x ) is the eigenket of X with the eigenvalue x. We again find that the square 
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of the modulus of the wave function ip(x) = < x \ \J /) is the particle’s position 
probability density. Now, to the eigenvector |p> of the observable P corresponds 
the plane wave : 

< * | P > — — j= e^ (C-2) 

v 2 nh 

and we have seen (§ C-3 of chapter I) that the de Broglie relations associate with 
this wave a well-defined momentum which is precisely p. In addition, the probability 
of finding, for a particle in the state I >, a momentum between p and p + dp is: 

d &{p) = |< P | ^ >| 2 dp = \ij/{p)\ 2 dp (C-3) 

This is indeed what we found in § C-3 of chapter I. 


2. Quantization of certain physical quantities 

As we have already pointed out, the third postulate enables us to explain 
the quantization observed for certain quantities, such as the energy of atoms. But 
it does not imply that all quantities are quantized, since observables exist whose 
spectrum is continuous. The physical predictions based on the third postulate are 
therefore not at all obvious a priori. For example, when we study the hydrogen 
atom (chap. VII), we shall start from the total energy of the electron in the coulomb 
potential of the proton, from which we shall deduce the Hamiltonian operator. 
Solving its eigenvalue equation, we shall find that the bound states of the system 
can only correspond to certain discrete energies which we shall calculate. Thus we 
shall not only explain the quantization of the levels of the hydrogen atom, but also 
predict the possible energy values, which can be measured experimentally. 
We stress the fact that these results will be obtained using the same fundamental 
interaction law which is used in classical mechanics in the macroscopic domain. 


3. The measurement process 

The fourth and fifth postulates pose a certain number of fundamental 
problems which we shall not consider here. There is, in particular, the question 
of the “fundamental” perturbation involved in the observation of a quantum 
system (cf. chap. I, §§A-2 and A-3). The origin of these problems lies in the fact 
that the system under study is treated independently from the measurement device, 
although their interaction is essential to the observation process. One should 
actually consider the system and the measurement device together as a whole. This 
raises delicate questions concerning the details of the measurement process. 

We shall content ourselves with pointing out that the nondeterministic 
formulation of the fourth and fifth postulates is related to the problems that we 
have just mentioned. For example, the abrupt change from one state vector to 
another due to the measurement corresponds to the fundamental perturbation 
of which we have spoken. But it is impossible to predict what this perturbation 
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will be, since it depends on the measurement result, which is not known with 
certainty in advance*. 

We shall consider here only ideal measurements. To understand this concept, 
let us return, for example, to the experiment of §A-3 of chapter I on polarized 
photons. It is clear that when we grant that all photons polarized in a certain 
direction traverse the analyzer, we assume that the analyzer is perfect. In practice, 
obviously, it also absorbs some of the photons that it should let through. We shall 
therefore make the hypothesis, in the general case, that the measurement devices 
used are perfect : this amounts to assuming that the perturbation they provoke is 
due only to the quantum mechanical aspect of the measurement. Of course, the 
devices which can actually be constructed always present imperfections which 
affect the measurement and the system; but one can, in principle, constantly 
ameliorate them and thus approach the ideal limit defined by the postulates which 
we have stated. 


4. Mean value of an observable in a given state 


The predictions deduced from the fourth postulate are expressed in terms of 
probabilities. To verify them, it would be necessary to perform a large number 
of measurements under identical conditions. That is, one would have to measure 
the same quantity in a large number of systems which are all in the same quantum 
state. If these predictions are correct, the proportion of AT identical experiments 
resulting in a given event will approach, as N —»- oo, the theoretically predicted 
probability & of this event. Such a verification can only be carried out in the 
limit where N —► oo; in practice, N is of course finite, and statistical techniques 
must be used to interpret the results. 

The mean value of the observable** A in the state | ij/ >, which we shall 
denote by (A )^, or, more simply, by < A >, is defined as the average of the results 
obtained when a large number N of measurements of this observable are performed 
on systems which are all in the state | ip >. When | \J/ > is given, the probabilities of 
finding all the possible results are known. The mean value < A ) |/ , can therefore be 
predicted. We shall show that if | tj/ > is normalized, < A is given by the formula : 


< A >* = < i/r | A (iP > 


(C-4) 


First consider the case where the entire spectrum of A is discrete. Out of N 
measurements of A (the system being in the state | \J/ ) each time), the eigenvalue a n 
will be obtained ^ (a n ) times, with: 


N N -oo 




(C-5) 


* Except, obviously, in the case where one is sure of the result that will be found (probability 
equal to 1 : the measurement does not modify the state of the system). 

++ We shall henceforth use the word “observable" to designate a physical quantity as well as the 
associated operator. 
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and: 


X ~r(aj = N 


(C-6) 


The mean value of the results of these N experiments is the sum of the values 
found divided by N (when JT experiments have yielded the same result, this 
result will clearly appear jV times in this sum). It is therefore equal to: 


oo, this mean value approaches: 


4 X a * ^( a n) 

n 

Using (C-5), we see that when N - 
<A}^ = £a„ 0>(a n ) 

n 

Now substitute into this formula expression (B-7) for ^(aj: 

ffn 

<A)+ = £a n £ < ^ | > < «' | ^ > 

n i=l 

Since : 

A !«•> = I M n > 

(C-9) can be written in the form: 

ifn 

< ^ >* = X X 


(C-7) 

(C-8) 

(C-9) 

(C-10) 


9n 

= < *1*\a X X K > <««I 

_ n i=l __ 


<A> 


(C-ll) 


Since the { |u'> } form an orthonormal basis of the expression in brackets 
is equal to the identity operator (closure relation), and we obtain formula (C-4). 

The argument is completely analogous for the case where the spectrum of A 
is continuous (for simplicity, we shall continue to assume it to be non-degenerate). 
Consider N identical experiments, and call d./C(a) the number of experiments 
which have yielded a result included between a and a + da. We have, similarly: 

d^C(a) 


JV N-oo 


d^(a) 


The mean value of the results obtained 
approaches: 


is *j“ 


d^a), which, when N 


< -4 X 


“1“ 


d^*(a) 


Substitute into (C-13) the expression for d^(a) given by (B-17): 


<^X 


=1“ 


< | v a > < v a > da 


We can use the equation : 
A K > = a K > 


(C-12) 

—* oo, 

(C-13) 

(C-14) 

(C-15) 
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to transform (C-14) into: 


<A\ = <*l '*!'■ > < v a I i/? > da 

= <^l y 4 [| da l l, “> < W “l]l^> (C ‘ l6) 

Using the closure relation satisfied by the state | v x >, we again find for¬ 
mula (C-4). 


COMMENTS : 


(?) < A >, the average over a set of identical measurements, must not be confused 

with the time averages sometimes taken when dealing with time-dependent 
phenomena. 

(/?) If the ket |t {/} representing the state of the system is not normalized, 
formula (C-4) becomes \cf. comment (?) of §B-3-b] : 


</!>* = 


< \A | ip > 

< i Ti > 


(C-17) 


(???) In practice, to calculate < A explicitly, one often places oneself in a particu¬ 
lar representation. For example: 


= Jd 3 r<</?|r><r|X|</?> 
= f d 3 r i/?*(r) x if* (r) 


(C-18) 


using the definition of the X operator [c/. chap. II, relations (E-22)]. Simi¬ 
larly : 

< r, >* - < * I r, I * > 


- jd 3 P P, Y<(p) 

or, using the { | r > } representation: 

< P* >* = j* rf3r <0| r >< r | /> ^|*A> 
= pr ^(r)[*|;Wr)] 


(C-19) 


(C-20) 


since P is then represented by -7 V [formula (E-26) of chapter II]. 
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5. The root-mean-square deviation 

< A > indicates the order of magnitude of the values of the observable A 
when the system is in the state 1 1 p}. However, this mean value does not give any 
idea of the dispersion of the results we expect when measuring A. Assume, for 
example, that the spectrum of A is continuous and that, for a given state | ip ), the 
curve representing the variation with respect to a of the probability den¬ 
sity p(u) = |<t>Ji//>| 2 has the shape shown in figure 2. For a system in the 
state | tp >, nearly all the values that can be found when A is measured are included 
in an interval of width 5A containing < A >, where the quantity bA characterizes the 
width of the curve: the smaller SA, the more the measurement results are 
concentrated about (A). 



FIGURE 2 

Variation with respect to a of the probability 
density p(a). The mean value < A > is the 
abscissa of the center of gravity of the area 
under the curve (it does not necessarily coincide 
with the abscissa <x m of the maximum of the 
function). 


How can we define, in a general way, a quantity which characterizes the 
dispersion of the measurement results about < A ) ? We might envisage the following 
method: for each measurement, take the difference between the value obtained 
and < A >; then calculate the average of these deviations, dividing their sum by 
the number N of experiments. It is easy to see, however, that the result obtained 
would be zero; we have, obviously : 

(A-<A» = <A>-<A> = 0 (C-21) 


By the very definition of (A), the average of the negative deviations balances 
exactly the average of the positive ones. 

To avoid this compensation, it suffices to define AA such that [AA) 2 is the 
mean of the squares of the deviations: 

(AA) 2 =<(/f -<4» 2 > (C-22) 


By definition, we therefore introduce the root-mean-square deviation A A by setting: 


AA = s/<{A -</l» 2 > 


(C-23) 


Using the expression for the mean value given in (C-4), we then have: 

aa = Vop~\{A -<a yfjfy 
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This relation can also be written in a slightly different way: 

<(A -<A» 2 > =<(A 2 -2<A>A + (A > 2 )> 

= <A 2 > - 2(A) 2 + (A> 2 

= (A 2 )-(A) 2 (C-25) 


The root-mean-square deviation A A is therefore also given by: 

A A - V< A 2 > - < A ) 2 (C-26) 

For example, in the case of the continuous spectrum of the observable A 
considered above, A A is given by: 


(AA) 2 


/» + oo 

= [a - < A >]V(«) d a 

J — 00 

/* + oo |- r +00 ~| 

= I a 2 p(a) da — a p( a) da 


(C-27) 


If definition (C-23) is applied to the observables R and P, it can be shown (com¬ 
plement C|„), using their commutation relations, that for any state | ij />, one has : 


'AX. AP X > h/2 
< A Y. AP y > Til2 

^AZ.AP z ^h/2 (C-28) 


In other words, we find the Heisenberg uncertainty relations again, but with 
a precise lower limit, which arises from the precise definition of the uncertainties. 


6. Compatibility of observables 

a. COMPATIBILITY AND COMMUTATION RULES 

Consider two observables A and B which commute: 

[A, B] = 0 (C-29) 

We shall assume for simplicity that both of their spectra are discrete. According 
to the theorem proved in § D-3-a of chapter II, there exists a basis of the state 
space composed of eigenkets common to A and B, which we shall denote 

by la„,& p , O- 

A | & n , bp, i ) Q n | ci n , bp , i ) 

B\a„,b p ,i) = b p \a„,b p ,i) (C-30) 

(the index i allows us to distinguish, if necessary, between the different vectors 
corresponding to one pair of eigenvalues). Therefore, for any a n and b p (chosen, 
respectively, in the spectra of A and B), there exists at least one state \a n ,b p , i) for 
which a measurement of A will always give a„ and a measurement of B will always 
give b p . Two such observables A and B which can be simultaneously determined are 
said to be compatible. 
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On the other hand, if A and B do not commute, a state cannot in general* be 
a simultaneous eigenvector of these two observables. They are said to be incom¬ 
patible. 


Let us examine more closely the measurement of two compatible observables on a system 
which is initially in an arbitrary (normalized) state | *p >. This state can always be written: 

I * > = I c «,P,i I «»> b P’ * > ( c - 31 ) 

«»p,i 

First assume that we measure A and then, immediately afterwards, B (before the system 
has had the time to evolve). Let us calculate the probability &{a n> b p ) of obtaining a n in the 
first measurement and b p in the second one. We begin by measuring A in the state | $ > ; the 
probability of finding a n is therefore : 

^K) = XkJ 2 (C-32) 

PA 

When we then measure B , the system is no longer in the state | tj/ > but, if we have found a n , 
in the state | ^ ): 

I K > = ■ , ■ — I IV 1 > < c - 33 ) 

./Ik..,/" 

V 

The probability of obtaining b p when it is known that the first measurement has yielded a n 
is therefore equal to: 


w = 


I k,J ; 


(C-34) 


The probability 0*(a n , b p ) sought corresponds to a “composite event” : to be in a favorable case, 
we must first find a n and then, having satisfied this first condition, find b p . Therefore: 

&{a m ,b p ) = &(a n ) x &Jb p ) (C-35) 


Substituting into this formula expressions (C-32) and (G-34), we obtain: 

&K> b p) = I kJ 2 


(C-36) 


Moreover, the state of the system becomes, immediately after the second measurement: 
1 


I Kp > = 


I KJ 


' Z I O 


(C- 37) 


Therefore, if we decide to measure either A or B again, we are sure of the result ( a n or b p ): | ^ p ) 
is an eigenvector common to A and B with the eigenvalues a n and b p respectively. 

Let us now return to the system in the state | tf/ ), and let us measure the two observables 
in the opposite order ( B , then A\ What is the probability &{b p , a n ) of obtaining the same 
results as before ? The reasoning is the same. We have here: 

nb p ,a n ) = &(b p ) x & bp (a n ) (C-38) 


* Some kets may be simultaneous eigenvectors of A and B . But there would not be a sufficient 
number of them to form a basis, as would be the case if A and B commuted. 


232 




C. PHYSICAL INTERPRETATION OF THE POSTULATES CONCERNING OBSERVABLES 


From (C-31), we see that: 

•^>,) = IkJ 2 

n.i 

and that, after a measurement of B which yields b p , the state of the system becomes: 
<p' r > = 7 = - -i— ■ = I c npi | a n , b p , i > 

£ k J 2 


Therefore : 


1 


^bj a n) - . |2 X Kp.il 2 

n.i 

and finally: 

&(b p ,a n ) = I |c„.J 2 

t 

If we have indeed found and then a n , the system has gone into the state: 

<P'p.« > = 7—‘ — I c n. P .i | k' > 

I k J 2 ‘ 


(C-39) 


(C-40) 


(C-41) 


(C-42) 


(C-43) 


When two observables are compatible, the physical predictions are the same, 
whatever the order of performing the two measurements (provided that the time 
interval which separates them is sufficiently small). The probabilities of obtaining 
either a then b„ or b„ then a„ are identical: 

n P P n 

&{a n ,b p ) = &(b p ,a„) - X k.J 2 = E l< b P >' I ^ >| 2 (C* 44 ) 


Moreover, the state of the system immediately after the two measurements is 
in both cases (if the results are a n and b p for A and B respectively): 


I K, > - I <„ > = I | b p , i > 

JLKpA ‘ 


(C-45) 


New measurements of A or B will yield the same values again without fail. 

The preceding discussion thus leads to the following result: when two 
observables A and B are compatible, the measurement of B does not cause any 
loss of information previously obtained from a measurement of A (and vice versa) 
but, on the contrary, adds to it. Moreover, the order of measuring the two 
observables A and B is of no importance. This last point, furthermore, enables us 
to envisage the simultaneous measurement of A and B. The fourth and fifth postu¬ 
lates can be generalized to the case of such a simultaneous measurement, as can be 
seen from formulas (C-44) and (C-45). To the result {a„,b p } correspond the 
orthonormal eigenvectors \a n , b p , />. From this, (C-44) and (C-45) can be seen to 
be applications of postulates (B-7) and (B-30). 

On the other hand, if A and B do not commute, the preceding arguments 
are no longer valid. To understand this in a simple way, imagine that the state 
space $ is replaced by the two-dimensional space of real vectors. The vectors | ) 
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and | u 2 ) in figure 3 are eigenvectors of A with eigenvalues a t and a 2 respectively; 
| v 1 ) and | v 2 > are eigenvectors of B with eigenvalues b 2 and b 2 respectively. 
Each of the two sets { | >, | u 2 ) } and { I v 2 >, | v 2 > } forms an orthonormal 

basis in S. We shall therefore represent them in figure 3 by two pairs of perpen- 



FIGURE 3 

Diagram associated with the successive measurement of two 
non-compatible observables A and B. The state vector of the 
system is | ^ ). The eigenvectors of A are | u i > and | u 2 > 
(eigenvalues a t and a 2 ), which are different from those of B, 
| v t > and | v 2 > (eigenvalues b x and b 2 )> 


dicular unit vectors. The fact that A and B do not commute implies that these 
two pairs do not coincide. The physical system under study is initially in the 
normalized state | >{/>, which is represented in the figure by an arbitrary unit 
vector. We measure A and find, for example, a,; the system goes into the 
state | Uj ). We then measure B and find, for example, b 2 ; the state of the system 
becomes | v 2 ): 

|*> K> K> (0-46) 

If, on the other hand, we perform the measurements in the opposite order, 
obtaining the same results: 

|*>— K> l«i> (C-47) 

The final state of the system is not the same in both cases. We also see from 

figure 3 that: 

P{a lt b 2 ) = \OH t \ 2 x \OK 2 \ 2 

&(b 2 , ai ) = \OH 2 \ 2 x \OK,\ 2 (C-48) 

Although | OK^\ = \OK 2 \, in general 1 0H 2 \ i=- \OH 2 \ and: 

&{b 2 , ai ) # P(a u b 2 ) (C-49) 

Therefore: two incompatible observables cannot be measured simultaneously. It can 
be seen from (C-46) and (C-47) that the second measurement causes the information 
supplied by the first one to be lost. If, for example, after the sequence represented 
in (C-46), we measure A again, we can no longer be sure of the result since | v 2 > is 
not an eigenvector of A. All that was gained by the first measurement of A is thus 
lost. 
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b. PREPARATION OF A STATE 


Let us consider a physical system in the state | tp ) and measure the obser¬ 
vable A (whose spectrum we assume to be discrete). 

If the measurement yields a non-degenerate eigenvalue a„, the state of the 
system immediately after this measurement is the corresponding eigenvector | u n >. 
In this case, it suffices to know the result of this measurement to be able to determine 
unambiguously the state of the system after this measurement, as it does not 
depend on the initial ket. As we have already noted, at the end of § B-3-c, this is 
c 

due to the fact that pSj | u n ) represents the same physical state as | u„ ) itself. 

The same does not hold true when the eigenvalue a„ found in the measure¬ 
ment is degenerate. In: 


*:> = 


1 


V 


IlK 


= IUk> 

* I* i = l 


(C-50) 


the absolute values of the coefficients c‘„ and their relative phases are signifi¬ 
cant (§ B-3-b-y). Since the c‘ n are fixed when the initial state | ip ) is specified, the 
state | \p'„ ) after the measurement therefore depends on | ip ). 

However, we saw in § a that two compatible observables A and B can be 
measured simultaneously. If the result ( a n , b p ) of this combined measurement 
corresponds to only one eigenvector | a n , b p > common to A and B, there is no 
summation over i in formula (C-37), which becomes: 


K, > 



(C-51) 


This state is physically equivalent to | a n , b p }. Again, specifying the result of the 
measurement uniquely determines the final state of the system, which is therefore 
independent of the initial ket | \p >. 

If there are associated with ( a„,b p ) several eigenvectors I a H , b p , i ) of A 
and B, we can go back to the first argument, measuring, at the same time as A 
and B, a third observable C which is compatible with both of them. We then arrive 
at the following conclusion : for the state of the system after a measurement to 
be completely defined uniquely by the result obtained , this measurement must be 
made on a complete set of commuting observables (§D-3-b of chapter II). This is 
the property which justifies physically the introduction of the concept of a C.S.C.O. 

The methods that can be used to prepare a system in a well-defined quantum 
state are analogous, in principle, to those used to obtain polarized light. When 
a polarizer is placed in the path of a light beam, the outgoing light is polarized 
along a direction which is characteristic of the polarizer and therefore independent 
of the state of polarization of the incoming light. Similarly, we can construct devices, 
intended to prepare a quantum system, in such a way that they only allow the 
passage of one state, corresponding to a particular eigenvalue of each of the 
observables of the complete set chosen. We shall study a concrete example of the 
preparation of a quantum system in chapter IV (§B-1). 


235 


CHAPTER III THE POSTULATES OF .QUANTUM MECHANICS 


comment: 

The measurement of a C.S.C.O. enables us to prepare only one of the 
basis states associated with this C.S.C.O. However, it is obvious that changing 
the set of observables allows us to obtain other states of the system. We shall 
see explicitly in a concrete example, in § B-1 of chapter IV, that we can 
prepare in this way any state of the space S. 


D. THE PHYSICAL IMPLICATIONS 

OF THE SCHRODINGER EQUATION 


The Schrodinger equation plays a fundamental role in quantum mechanics since, 
according to the sixth postulate stated above, it is the equation which governs 
the time evolution of the physical system. In this section, we shall study in 
detail the most important properties of this equation. 


1. General properties of the Schrodinger equation 

a. DETERMINISM IN THE EVOLUTION OF PHYSICAL SYSTEMS 

The Schrodinger equation: 

ft -jj | <A(0 > = m | m > (D-l) 

is of first order in t. From this it follows that, given the initial state | ^(/ 0 ) ), the 
state | \J/(t) ) at any subsequent time t is determined. There is no indeterminacy in 
the time evolution of a quantum system. Indeterminacy appears only when a 
physical quantity is measured, the state vector then undergoing an unpredictable 
modification (cf. fifth postulate). However, between two measurements, the state 
vector evolves in a perfectly deterministic way, in accordance with equation (D-l). 

b. THE SUPERPOSITION PRINCIPLE 

Equation (D-l) is linear and homogeneous. It follows that its solutions are 
linearly superposable. 

Let ||M0> an< * 1^2(0) be two solutions of (D-l). If the initial state of the 
system is | ^(r 0 )> = A x |^,(/ 0 )> -I- A 2 |i^ 2 (r 0 )> (where Aj and X 2 are two complex 
constants), to it corresponds, at time t, the state 1 1 l/(t)} = A t | )> + A 2 1 1 j/ 2 (t )). 

The correspondence between | ip(t 0 )} and | ij/(t) ) is therefore linear. We shall later 
study (complement F,„) the properties of the linear operator U(t, t 0 ) which trans¬ 
forms 1 1 {/(t 0 ) > into 1^(0) • 

|*(0> = £/(Mo)|*('o)> (D-2) 
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c. CONSERVATION OF PROBABILITY 

a. The norm of the state vector remains constant 

Since the Hamiltonian operator H(t) which appears in (D-l) is Hermitian, 
the square of the norm of the state vector, < ip(t) \ \j/(t) >, does not depend on t as we 
shall now show : 


< m | .Mo > = < m | | m > + < m j ^ | m >] (d-3> 

According to (D-l), we can write : 

f t | m > = ^ m | m > (d-4) 

Taking the Hermitian conjugates of both sides of (D-4), we find: 

j t < <mo | = - < m | n\ t ) = -j h < m | HU) (D-5) 

since H (r) is Hermitian (it is an observable). Substituting (D-4) and (D-5) into (D-3), 
we obtain: 

j t < m | w) > = - ± < m | H( t ) | m > + -^ < m \ hw | wo > 

= 0 (D-6) 


The property of the norm conservation is very useful in quantum 
mechanics. For example, it becomes indispensable when we interpret the square 
of the modulus t )| 2 of the wave function of a spinless particle as being the 
position probability density. The fact that the state | \j/(t 0 ) > of the particle is norma¬ 
lized at time t 0 is expressed by the relation : 


< <A(* 0 ) I <A( f o> > 



(D-7) 


where ^(r, / 0 ) = <r|i^(f 0 )> is the wave function associated with |i^(t 0 )>- Equa¬ 
tion (D-7) means that the total probability of finding the particle in all space is 
equal to 1. The property of conservation of the norm which we have just proved is 
expressed by the equation: 


< WO | WO > = |d 3 r |»/dr, f)| 2 = < i l/(t 0 ) | W*o) > 


(D-8) 


where | WO > is the solution of (D-l) which corresponds to the initial state 1 1 //(t 0 )). 
In other words, time evolution does not modify the global probability of finding the 
particle in all space, which always remains equal to 1. Thus t)\ 2 can be 

interpreted as a probability density. 
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p. Local conservation of probability. Probability densities and probability currents 

In this paragraph, we shall confine ourselves to the case of a physical system 
composed of only one (spinless) particle. 

In this case, if \{/(r, t) is normalized, 

P(r, t) = |^(r,Ol 2 (D-9) 

is a probability density : the probability d^(r, t) of finding, at time t, the particle 
in an infinitesimal volume d 3 r located at the point r is equal to: 

d^(r, t) = p(r, t)d 3 r (D-10) 

We have just shown that the integral of p(r, t ) over all space remains constant 
for all time (and equal to 1 if ip is normalized). This does not mean that p(r, t ) must 
be independent of t at every point r. The situation is analogous to the one encoun¬ 
tered in electromagnetism. If, in an isolated physical system, there is a charge 
distributed in space with the volume density p( r, t ), the total charge [the integral 
of p(r, t ) over all space] is conserved over time. However, within the system, the 
spatial distribution of this charge may vary, giving rise to electric currents. 

In fact, this analogy can be carried further. Global conservation of electrical 
charge is based on local conservation. If the charge Q contained within a fixed 
volume V varies over time, the closed surface S which limits V must be traversed by 
an electric current. More precisely, the variation d Q during a time d t of the charge 
contained within V is equal to — I dt, where / is the intensity of the current which 
traverses S, that is, the flux of the vector current density J(r, t) leaving S. Using 
classical vector analysis, we can express local conservation of electrical charge in the 
form: 


■j t p{r, t) + div J (r, f) = 0 (D-ll) 

We are going to show here that it is possible to find a vector J(r, t), a probability 
current, which satisfies an equation identical to (D-ll): there is then local conser¬ 
vation of probability. Thus, it is as if we were dealing with a “probability fluid” 
whose density and motion are described by p(r, t) and J(r, t). If the probability of 
finding the particle in the (fixed) volume d 3 r about r varies over time, it means that 
the probability current has a non-zero flux accross the surface which limits this 
volume element. 

First of all, let us assume that the particle under study is subject to a scalar 
potential V (r, t). Its Hamiltonian is then: 

H=^+ y (R,t) (D-12) 

and the Schrodinger equation is written, in the { | r ) } representation (see comple¬ 
ment D„): 

i*j t Mr, t) = - ^ A^r, t ) + V(r, t) <A(r, t) (D-13) 
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V(r,t) must be real for H to be Hermitian. The complex conjugate equation 
of (D-l 3) is therefore: 

- ih-Q t t) = - — <#*(r, t ) + V{r, t) ^*(r, t) (D-14) 


Multiply both sides of (D-13) by t) and both sides of (D-14) by — ^(r, /). 
Add the two equations thus obtained. It follows that: 

m lft [^*( r ’ = IV*-# - 'M'/'*] (D-l5) 


that is: 

■jt ^ r ’ 0 + 2~j C*A*( r ’ 0-#( r > 0 - rM^*(r, 0] = 0 


(D-l 6) 


If we set: 

J(r, 0 = 



equation (D-16) can be put into the form of (D-l 1) since : 
div J(r, t) = \ .3 

= 2~- [W*) • W) + <WV) - W) . (v<n - ^(vV*)] 
= 2 ~ 


(D-l 7) 


(D-l 8) 


We have therefore proved the equation of local conservation of probability and 
have found the expression for the probability current in terms of the normalized 
wave function i(/(r, t ). 


COMMENT: 

The form of the probability current (D-17) can be interpreted as follows. 
J(r, t) appears as the mean value, in the state 1 1 ^(/) >, of an operator K(r) given by: 

K ( r)=^[| r><r | p + p | r> < r |] (D-19) 

Now the mean value of the operator | r > < r | is |^(r, t )| 2 , that is, the probability 

p 

density p(r, t), and — is the velocity operator i r . Therefore, K is the quantum operator 
m 

constructed, with the help of an appropriate symmetrization, from the product of the 
probability density and the velocity of the particle. This indeed corresponds to the vector 
current density of a classical fluid (it is well known, for example, that the electrical 
current density associated with a fluid of charged particles is equal to the product of the 
charge volume density and the velocity of the particles). 
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If the particle is placed in an electromagnetic field described by the poten¬ 
tials U(r,t) and A(r, /), we can use the preceding argument, starting with the 
Hamiltonian (B-46). We then find, in this case : 


J(r, t) =j- Re 


{**[; 


" V- qA 


I* 


(D-20) 


We see that this expression can be obtained from (D-17), using the same rule as was 
used for the Hamiltonian: P is simply replaced by P — q\. 

Example of a plane wave. Consider a wave function of the form : 

Mr, /) = A e i,k '~ <0,) (D-21) 

fi 2 k 2 

with : Tiu> - -.The corresponding probability density: 

2m 


p(r, t) = |Mr, 0| 2 = Ml 2 (D-22) 

is uniform throughout all space and does not depend on time. The calculation 
of J(r, t) from (D-17) presents no difficulties and leads to: 

fik 

J(r, t) = M| 2 — = P(r, 0 v G (D-23) 


where v G = — is the group velocity associated with the momentum fik (chap. I, 

§C-4). We see that the probability current is indeed equal to the product of the 
probability density and the group velocity of the particle. In this case, p and J are 
time-independent: the flow of the probability fluid associated with a plane wave is 
in a steady state condition (since p and J do not depend on r either, this state is also 
homogeneous and uniform). 


d. EVOLUTION OF THE MEAN VALUE OF AN OBSERVABLE; 

RELATIONSHIP WITH CLASSICAL MECHANICS 

Let A be an observable. If the state \\p(t)) of the system is normalized (and 
we have just seen that this normalization is conserved for all t ), the mean value of 
the observable A at the instant t is equal to*: 

</!>(/)= <M0MIM0> (D-24) 

We see that </<)(/) depends on t through |i/f(0> [and <i/r(0l]’ which evolve 
over time according to the Schrodinger equation (D-4) [and (D-5)]. Moreover, 
the observable A may depend explicitly on time, causing an additional varia¬ 
tion of < A}(t) with respect to t. 

We intend to study, in this section, the evolution of < A )(t) and to show 
how this enables us to relate classical mechanics to quantum mechanics. 


* The notation < A >(/) means that the’ mean value < A ) of A is a number which depends on t. 
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a. General formula 

Differentiating (D-24) with respect to f, we obtain: 

J t < m I A(t) I <j,(t) > = ^ < m |J/i(t) I *(t) > + < lA(r) I | *(f) >J 

f)A 

+ <^(t) |-^-| |A(0> (D-25) 

Using (D-4) and (D-5) for | i//(r) > and < i l/(t) |, we find : 
j ( < | /4(f) | Mt) > = 1 < .//(t) | [/1(f) H(t) - H(t) 4(f)] | ^(f) > 

+ < ip(t) | 1 i//(f) > (D-26) 


that is: 


A 1 r)A 


(D-27) 


comment: 

The mean value (Ay is a number which depends only on t. It is essential 
to understand how this dependence arises. For example, consider the case of 
a spinless particle. Let sd{ r, p, t) be a classical quantity. In classical mecha¬ 
nics, r and p depend on time (they evolve according to the Hamilton equa¬ 
tions), so that sd( r, p, /) depends on t explicitly, and implicitly through r 
and p. To the classical quantity p, t) corresponds the Hermitian ope¬ 
rator A = sd(R, P, t ), obtained by replacing, in sd, r and p by the opera¬ 
tors R and P (quantization rules, see § B-5). The eigenstates and eigenvalues 
of R and P and, consequently, these observables themselves, no longer depend 
on t. The time dependence of r and p, which characterizes the time evolution 
of the classical state, no longer appears in R and P, but in the quantum 
state vector |i f/(t)}, associated in the { |r> } representation with the wave 
function i//(r, t) = <r|i//(f)>. In this representation, the mean value of A is 
written: 

< A > = j*4 3 r r(r, t ) .a^r, * V, f^ .//(r,') (D-28) 

It is clear that integration over r leads to a number which only depends on t. 
With regard to classical mechanics, it is this number and not the ope- 

jl i L 

rator sd(r, -? V, t) which is to be compared with the value taken on by the 
classical quantity sd(r, p, t ) at time t ( cf §y below). 
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P. Application to the observables R and P (Ehrenfest’s theorem) 

Now let us apply the general formula (D-27) to the observables R and P. 
We shall consider, for simplicity, the case of a spinless particle in a scalar stationary 
potential F (r). We then have : 


H = + F(R) (D-29) 

so that we can write: 

1<R) =!<[*,«]> =i<[R,g> (D-30) 

■| <p> = ii<[ p - H ]> = i < [ p -'w] > < D - 31) 


The commutator which appears in (D-30) can easily be calculated from the cano¬ 
nical commutation relations; we obtain: 



For the one in formula (D-31), the following generalization of formula (B-33) must 
be used (cf. complement B„, formula (48)] : 

[P, F(R)] = - ih VF(R) (D-33) 


where VF(R) denotes the set of three operators obtained by replacing r by R in the 
three components of the gradient of the function F(r). Therefore: 



(D-34) 

(D-35) 


These two equations express Ehrenfest’s theorem. Their form recalls that of the 
classical Hamilton-Jacobi equations for a particle (appendix III, §3): 



(D-36-a) 



-VF(r) 


which reduce, in this simple case, to Newton’s well-known equation: 


dp _ cPr _ 
dt df 2 


- VF(r) 


(D-36-b) 


(D-37) 


Y- Discussion of Ehrenfest’s theorem; classical limit 

Let us analyze the physical meaning of Ehrenfest’s theorem, that is, 
equations (D-34) and (D-35). We shall assume that the wave function i(/(r, t) 
which describes the state of the particle is a wave packet like the ones 
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we studied in chapter I. <R> then represents a set of three time-dependent 
numbers {(Xy, < y>, <Z> }. We shall call the point <R>(t) the center of the 
wave packet * at the instant t. The set of those points which correspond to the 
various values of t constitutes the trajectory followed by the center of the wave 
packet. Recall, however, that one can never rigorously speak of the trajectory 
of the particle itself, whose state is described by the wave packet as a whole, which 
inevitably has a certain spatial extension. We see, nevertheless, that if this extension 
is much smaller than the other distances involved in the problem, we can 
approximate the wave packet by its center. In this limiting case, there is no 
appreciable difference between the quantum and classical descriptions of the 
particle. 

It is therefore important to know the answer to the following question: 
does the motion of the center of the wave packet obey the laws of classical mecha¬ 
nics? This answer is supplied by Ehrenfest’s theorem. Equation (D-34) expresses 
the fact that the velocity of the center of the wave packet is equal to the aveiage 
momentum of this wave packet divided by m. Consequently, the left-hand side 

d 2 

of (D-35) can be written m —- < R ), so that the answer to the preceding question 

dt 2 

will be affirmative if the right-hand side of (D-35) is equal to the classical force F c , 
at the point where the center of the wave packet is situated: 

F f/ = [- VF(r)] r=(R> (D-38) 

In fact, the right-hand side of (D-35) is equal to the average of the force over the 
whole wave packet, and, in general: 

<VK(R)> f lW(r)l=<n> (D-39) 

(in other words, the mean value of a function is not equal to its value for the mean 
value of the variable). If we are rigorous, the answer to the question we asked is 
therefore negative. 


COMMENT : 

It is easy to convince ourselves of (D-39) if we consider a concrete example. 
Let us choose, for simplicity, a one-dimensional model, and assume that: 


V(x) = kx n 


(D-40) 


where A is a real constant and n, a positive integer. From this we deduce the operator 
associated with F(x): 


V(X) = kX 


(D-41) 


The left-hand side of (D-39) can be written ^replacing V by-^2n < X" 1 >. As for the 
right-hand side, it is equal to: 

= [An x"'*] x=<x> = An <*>-» (D-42) 


* The center and the maximum of a wave packet are, in general, distinct. They coincide, however, 
if the wave packet has a symmetrical shape (§05, fig. 2). 
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Now we know that in general < X n ~ 1 > ^ < X > w ' 1 ; for example, for n — 3, we 
have < X 2 ) / < > 2 (since the difference between these two quantities enters into the 

calculation of the root-mean-square deviation AX). 

Note however that for n = 1 or 2, < A""” 1 > = < X > n “ 1 . The two sides of (D-39) 
are then equal. The same holds true, moreover, for n = 0, in which case both sides are 
equal to zero. For a free particle (n = 0), or a particle placed in a uniform force field (n — 1) 
or in a parabolic potential well (n = 2; the case of a harmonic oscillator), the motion 
of the center of the wave packet therefore rigorously obeys the laws of classical mecha¬ 
nics. We have already, moreover, established this result for the free particle (n = 0) 
(cf. chapter I, §C-4). 


Although the two sides of (D-39) are not, in general, equal, there exist 
situations (called quasi-classical) where the difference between these two quantities 
is negligible: this is the case when the wave packet is sufficiently localized. To see 
this, let us write explicitly, in the { | r ) } representation, the left-hand side of this 
equation: 


<V V(R) > 


d 3 r\l/*(r, f)[VF(r)Mr, t) 
d 3 r |^(r, 0| 2 VF(r) 


(D-43) 


Let us assume the wave packet to be highly localized : more precisely, |i//(r, t)\ 2 
takes on non-negligible values only within a domain whose dimensions are much 
smaller than the distances over which F (r) varies appreciably. Then, within this 
domain, centered about <R>, VF(r) is practically constant. Therefore, in (D-43), 
VF(r) can be replaced by its value for r = <R) and taken outside the integral, 
which is then equal to 1, since i/f(r, t ) is normalized. Thus we find that for sufficiently 
localized wave packets: 

< VF(R) > ~ [VF(r)] r=<R> (D-44) 


In the macroscopic limit (where the de Broglie wavelengths are much smaller than 
the distances over which the potential varies*), wave packets can be made suf¬ 
ficiently small to satisfy (D-44) while retaining a good degree of definition for the 
momentum. The motion of the wave packet is then practically that of a classical 
particle of mass m placed in the potential F (r). The result that we have thus established 
is very important since it enables us to show that the equations of classical mecha¬ 
nics follow from the Schrodinger equation in certain limiting conditions satisfied, 
in particular, by most macroscopic'systems. 


2. The case of conservative systems 

When the Hamiltonian of a physical system does not depend explicitly on 
time, the system is said to be conservative. In classical mechanics, the most impor¬ 
tant consequence of such a situation is the conservation of energy over time. It can 

* See the order of magnitude of the de Broglie wavelengths associated with a macroscopic system 
in complement A,. 
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also be said that the total energy of the system is a constant of the motion. We shall 
see in this section that in quantum mechanics as well, conservative systems possess 
important special properties in addition to the general properties of the preceding 
section. 


a. SOLUTION OF THE SCHRODINGER EQUATION 

First, let us consider the eigenvalue equation for H : 

H | <p„. t > = E n | <p„, t > (D-45) 

For simplicity, we assume the spectrum of H to be discrete, r denotes the set of 
indices other than n which are necessary for characterizing a unique vector | <p„ r > 
(in general, these indices will fix the eigenvalues of operators which form a C.S.C.O. 
with H ). Since, by hypothesis, H does not depend explicitly on time, neither the 
eigenvalue E n nor the eigenket | (p n t ) is t dependent. 

First, we are going to show that given the E n and the | (p n t >, it is very simple 
to solve the Schrodinger equation, that is, to determine the time evolution of any 
state. Since the form a basis (H is an observable), it is always possible, for 

every value of t, to expand any state 1 1 //(/)> of the system in terms of the | (p n x ) : 

\m> = lc n .M) I <?„.,> (D-46) 

n,T 

with: 

c n .rU) = < <P n , | Ht) > (D-47) 

Since the | <p„ t ) do not depend on t, all the time dependence of | ij/(t )) is contained 
within the c n z (t). To calculate the c n t (r), let us project the Schrodinger equation 
onto each of the states | <p n-T >. This yields*: 

ih j t < |. Ht) > - < <P„,r | H | m > (D-48) 

Since H is Hermitian, it can be deduced from (D-45) that: 

< <p n , t | H = E n < <p„' t | (D-49) 

so that (D-48) can be written in the form : 

ih±c n ' Z (t) = E n c„' Z (t) (D-50) 

This equation can be integrated directly to give: 

CnJO = CnJ'o) e-‘ £ » ( '-‘°>' s 


* In (D-48). < (p n t | can be placed to the right of —, since < cp n t [ does not depend on t. 
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When H does not depend explicitly on time, to find 1 1 (/(t) >, given 
proceed as follows : 

1 H ( o) X 

(i) Expand 1 1 A(/ 0 )) terms of a basis of eigenstates of H\ 


| *A('o) > = II | <Pn .t ) 

n t 

(D-52) 

c„ t (f 0 ) is given by the usual formula: 


= (<Pn,r | '/'(to)) 

(D-53) 

(ii ) Now, to obtain | ij/(t ) > for arbitrary t, multiply each coefficient c„ tX (t 0 ) 
of the expansion (D-52 ) by e " iEnU ~ ,o)lh , where E n is the eigenvalue of //associated 
with the state | 

1 ^( f ) > = IE C V»(t 0 ) e iE ” ( ‘ ,0) ' b | (Pn.r > 

n x 

(D-54) 


The preceding argument can easily be generalized to the case where the 
spectrum of H is continuous; formula (D-54) then becomes, with obvious notation : 

| m > = £ JdE c x (E, t 0 ) e _i£,,_,o)/B | cp Ex > (D-55) 


b. STATIONARY STATES 

An important special case is that in which | ip(t 0 )} is itself an eigenstate of H. 
Expansion (D-52) of |iA(t 0 )) then involves only eigenstates of H with the same 
eigenvalue (for example, E n ): 

| «K'o) > = I <W(t 0 ) | (p„ >x > (D-56) 

x 

In formula (D-56), there is no summation over n, and the passage from |^(t 0 )> 
to 1 1 //(/)) involves only one factor -<<>)/» which can be taken outside the 

summation over r: 

I m > = z c B>t (t 0 ) e - i£ " (, -‘ o)/s | (p nt > 

T 

= e~ iEn(t ~ to)lh £ c„Jt 0 ) | (p n t > 

T 

= e _ ‘ £n(t ~ ,0,/s | ip(t 0 ) > (D-57) 

|and \ il/{t 0 )y therefore differ only by the global phase factor 
These two states are physically indistinguishable (cf. discussion in §B-3-b-y). From 
this we conclude that all the physical properties of a system which is in an 
eigenstate of H do not vary over time; the eigenstates of H are called, for this 
reason, stationary states. 
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It is also interesting to see how conservation of energy in a conservative 
system appears in quantum mechanics. Let us assume that, at time t 0 , we measure 
the energy of such a system and we find, for example; E k . Immediately after the 
measurement, the system is in an eigenstate of //, with an eigenvalue of E k (the 
postulate of the reduction of the wave packet). We have just seen that the 
eigenstates of H are stationary states. Therefore, the state of the system will no 
longer evolve after the first measurement and will always remain an eigenstate of H 
with an eigenvalue of E k . It follows that a second measurement of the energy of the 
system, at any subsequent time t, will always yield the same result E k as the first one. 

comment: 

One passes from (D-52) to (D-54) by multiplying each coefficient c n t (r 0 ) 
of (D-52) by e~ iEn(l ~ to)lh . The fact that j s a phase factor should 

not lead us to believe that | t//(r)> and |^(f 0 )> are always physically indis¬ 
tinguishable. Actually, expansion (D-52) involves, in general, several eigen¬ 
states of H with different eigenvalues. To these different possible values of E n 
correspond different phase factors. This modifies the relative phases of the 
expansion coefficients of the state vector and leads, consequently, to a 
state | \j/(t) ) which is physically distinct from | i//(t G )). 

Only in the case where only one value of n enters into (D-52) [the case 
where |i/'(f 0 )> is an eigenstate of //] is the time evolution described by a 
single phase factor, which is then a global one, of no physical importance. 
In other words, there is physical evolution over time only if the energy of the 
initial state is not known with certainty. We shall come back later to the 
relation between time evolution and energy uncertainty (cf §D-2-e). 


c. CONSTANTS OF THE MOTION 

By definition, a constant of the motion is an observable A which does not 
depend explicitly on time and which commutes with H: 

[£-0 

St (D-58) 

[A, H] = 0 

For a conservative system, H is therefore itself a constant of the motion. 

Constants of the motion possess important properties which we are now 
going to derive. 

(i) If we substitute (D-58) into the general formula (D-27), we find: 

J t < A > - j- t < m I A | *(r) > = 0 (D-59) 


Whatever the state | *j/(t )) of the physical system, the mean value of A in this state 
does not evolve over time (hence the term “constant of the motion”). 
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(it) Since A and H are two observables which commute, we can always find 
for them a system of common eigenvectors, which we shall denote by { | <p„ p r > }: 

H . ! E "\ (f>n ' p ' t \ (D-60) 

A I > = °p I <Pn,p,t > 

We shall assume for simplicity that the spectra of H and A are discrete. The 
index r fixes the eigenvalues of observables which form a C.S.C.O. with H and A. 
Since the states | <p npt > are eigenstates of H, they are stationary states. If the 
system is in the state |<p npt > at the initial instant, it will therefore remain there 
indefinitely (to within a global phase factor). But the state | p t > is an eigenstate 
of A as well. When A is a constant of the motion, there therefore exist stationary 
states of the physical system (the states |<p f >) which always remain, for all l, 

eigenstates of A with the same eigenvalue ( a p ). The eigenvalues of A are called, for 
this reason, good quantum numbers. 

(iii) Finally, let us show that for an arbitrary state |t//(/)>, the probability 
of finding the eigenvalue a p , when the constant of the motion A is measured, is not 
time-dependent. | <A(t 0 ) ) can always be expanded on the {| <p n , P , T )} basis 
introduced above: 


| «A( r o) > = XIX Cn.pJ* o) | (Pn.p.T > 

(D-61) 

n p x 

From this we directly deduce : 

1 WO > = SSS 1 v..p,, > 

(D-62) 

n p x 

with: 

c n . P , t (0 = c n .p,Mo) e -, ’ £ » <<-,0)/ * 

(D-63) 


According to the postulate of spectral decomposition, the probability ^(a p , t 0 ) 
of finding a p when A is measured at time f 0 , on the system in the state | ^(/ 0 )>, is 
equal to: 

na p , O = XX k P . t (t 0 )| 2 (D-64) 

n t 

Similarly: 

&(a p , t) = XX |c„. p>l (0| 2 (D-65) 

n x 

Now we see from (D-63) that c n p x (t ) and c n p t (f 0 ) have the same modulus. There¬ 
fore, a p , t) — t?(a p , t 0 ), which proves the property stated above. 

comment: 

If all but one of the probabilities a p , t 0 ) are zero [leaving for 
example ^(a k ,t 0 ) non-zero and, moreover, necessarily equal to 1], the 
physical system at time t 0 is in an eigenstate of A with an eigenvalue of a k . 
Since the a p , t) do not depend on t, the state of the system at time t 
remains an eigenstate of A with an eigenvalue of a k . 
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d. BOHR FREQUENCIES OF A SYSTEM. SELECTION RULES 


Let B be an arbitrary observable of the system under consideration (it does 
not necessarily commute with H). Formula (D-27) enables us to calculate the 

derivative -r- < B >of the mean value of B : 
dr 

~<8)4<[*.#]>+<f> (D-66) 


For a conservative system, we know the general form (D-54) of | ^(r)>. Therefore, 
in this case, we can calculate explicitly < i l/(t) \ B 1 1 p(t )) |and not merely ^ < B 

The Hermitian conjugate expression of (D-54) is written (changing the sum¬ 
mation indices): 


< m | - II c*Jt 0 ) e‘ £ " °”' 0,/A < (p„. x | (D-67) 

ri z' 

We can then, in < ij/(t) \ B | \p(t ) >, replace | \p{t ) > and < ip(t) | by expansions (D-54) 
and (D-67), respectively. Thus we obtain : 

< m | b | m > — < b > (o 

= IIII C njt 0 )< </V,t | B | <P n ,t > e i<E "'“ E "><‘'' u>/ * 

n t n' t' 

(D-68) 


From now on, we shall assume that B does not depend explicitly on time: the matrix 
elements < <p n . 1 .1 B\ q> H t > are therefore constant. Formula (D-68) then shows that 
the evolution of < B ) (r) is described by a series of oscillating terms , whose fre¬ 


quencies 


: 

i i 

i£ 

i 

1 

C*J 

a 

2 n h 

h 


v n . n are characteristic of the system under 


consideration but independent of B and of the initial state of the system. The 
frequencies v„.„ are called the Bohr frequencies of the system. Thus, for an atom, the 
mean values of all the atomic quantities (electric and magnetic dipole moments, etc...) 
oscillate at the various Bohr frequencies of the atom. It is reasonable to imagine that 
only these frequencies can be radiated or absorbed by the atom. This remark 
allows us to understand intuitively the Bohr relation between the spectral fre¬ 
quencies emitted or absorbed and the differences in atomic energies. 

It can also be seen from (D-68) that, while the frequencies involved in the 
motion of < B > (/) are independent of B, the same does not hold true for the 
respective weights of these frequencies in the variation of < B>. The importance of 
each frequency v„.„ depends on the matrix elements < <p n . t . j B | cp n t ). In parti¬ 
cular, if these matrix elements are zero for certain values of n and n\ the cor¬ 
responding frequencies v n .„ are absent from the expansion of <B>(f), whatever 
the initial state of the system. This is the origin of the selection rules which indicate 
what frequencies can be emitted or absorbed under given conditions. To establish 
these rules, one must study the non-diagonal matrix elements (n ^ n') of the various 
atomic operators such as the electric and magnetic dipoles, etc... 
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Finally, the weights of the various Bohr frequencies also depend on the initial 
state, via c*, t ,(r 0 )c nr (f 0 ). In particular, if the initial state is a stationary state of 
energy E k , the expansion of | i/f(t 0 )) contains only one value of n (n = k) and 
c n',t'( ? o) c n,t( ? o) can be non-zero only for n = ri = k. In this case, <£> is not 
time-dependent. 


comment : 

It can be directly verified, using (D-68), that the mean value of a constant 
of the motion is always time-independent. We see that if B commutes with H, 
the matrix elements of B are zero between two eigenstates of H which cor¬ 
respond to different eigenvalues (cf chap. II, §D-3-a). It follows that 
(<p n - tX ’\B\(p HiX y is zero for ri # n. The only terms of B which are non-zero 
are thus constant. 


e. THE TIME-ENERGY UNCERTAINTY RELATION 

We shall now see that for a conservative system, the greater the energy 
uncertainty, the more rapid the time evolution. More precisely, if At is a time 
interval at the end of which the system has evolved to an appreciable extent, 
and if AE denotes the energy uncertainty. At and AE satisfy the relation: 


At . AE > h 


(D-69) 


First, if the system is in an eigenstate of H, its energy is perfectly well- 
defined : AE = 0. But we have seen that such a state is stationary; that is, it does 
not evolve. It can be said that, in this case, the evolution time At, is, in a sense, 
infinite [relation (D-69) indicates that when AE = 0, At must be infinite]. 

Now let us assume that |^(/ 0 )> is a linear superposition of two eigenstates 
of H, | </>! > and | <p 2 >, with different eigenvalues E x and E 2 : 

l^(<o)> = c 1 \(p i y + C 2 |<p 2 > (D-70) 

Then: 

|i//(0> = Cj e~ iE ' (, -‘ 0)/n \(p 1 > + c 2 e _iE2<,_,o)/ * | q> 2 > (D-71) 

If we measure the energy, we find either E v or E 2 . The uncertainty of E is therefore 
of the order of: 

AE ci \E 2 - E, | (D-72) 

Now consider an arbitrary observable B which does not commute with H. The 
probability of finding, in a measurement of B at time t, the eigenvalue b m associated 
with the eigenvector | u m > (we assume, for simplicity, b m to be non-degenerate) is 
given by: 


= |< u m | ip(t)} | 2 = IcJ 2 |< u m | <p, >| 2 + |c 2 | 2 |< u m 
+ 2 Re [cfc, < u m \ cp 2 >* < u m | 


k 2 >| 2 

<?!>] (D-73) 
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This equation shows that ^{b m , t ) oscillates between two extreme values, with the 
Bohr frequency v 21 = — - ^ The characteristic evolution time of the system is 
therefore: 


At 


h 

1 ^ 7 - 


(D-74) 


and comparison with (D-72) shows that: AE. At ^ h. 

Let us now assume that the spectrum of H is continuous (and non-degenerate). 
The most general state I iA(t 0 )) can be written: 

| Mo)> = Jd£ c(E) | <p E > (D-75) 


where | tp E ) is the eigenstate of H with the eigenvalue E. Let us assume that |c(£)| 2 
has non-negligible values only in a domain of width AE about E 0 (fig. 4). AE then 



FIGURE 4 

By superposing the stationary states | q> E > 
with the coefficients c(E ), we obtain a 
state | \j/ > of the system where the energy 
is not perfectly well-defined. The corres¬ 
ponding uncertainty AE is given by the 
width of the curve which represents | c(E)\ 2 . 
According to the fourth uncertainty rela¬ 
tion, the evolution of the state | \p(t )) will 
be significant after a time At such that 
AE . At ;> ft. 


represents the uncertainty of the energy of the system. | \p(t)) is obtained by 
using (D-55); 

| m > = fd£ c(E) | tp E > (D-76) 


The quantity &{b m , t ) introduced above, which represents the probability of finding 
the eigenvalue b m when the observable B is measured on the system in the 
state 1 1 {/(t) >, is here equal to: 


0 = |< w m | <A(t) >| 2 


f 


d£ c(£)e 




< | <PE > 


(D-77) 


In general, < u m \ (p E > does not vary rapidly with E when E varies about E 0 . If AE is 
sufficiently small, the variation of < u m \(p E >, in integral (D-77), can be neglected 
relative to that of c(E). One can then replace < u m \ (p E > by < u m \ (p Eo > and take this 
quantity outside integral (D-77): 


t) |< u m | (p Eo >| 2 


1 


d £■ c{E) Q~ m ~ ,0)lh 


(D-78) 
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If this approximation is valid, we thus see that 0 > (b m , t ) is, t,o within a coefficient, the 
square of the modulus of the Fourier transform of According to the properties 
of the Fourier transform (if. appendix I, §2-b), the width in t of &(b m , t), that 
is, At, is therefore related to the width AE of |c(.E)| 2 by relation (D-69). 


comment : 

(D-69) can be established directly for a free one-dimensional wave packet. One 
can associate with the momentum uncertainty Ap of this wave packet an energy 

d E d E dco 

uncertainty AE = — Ap. Since E = hco and p = hk , we have — = —- — v r , where v G 
dp dp dk 

is the group velocity of the wave packet (chap. I, § C-4). Consequently: 

AE=v 0 Ap (D-79) 


Now the characteristic evolution time At is the time taken by this wave packet, travelling 
at the velocity v G , to “pass” a point in space. If Ax is the spatial extension of the wave 
packet, we therefore have: 

Ax 

At ~ — (D-80) 


From this we deduce, combining (D-79) and (D-80): 
AE . At a. Ax . Ap > h 


(D-81) 


Relation (D-69) is often called the fourth Heisenberg uncertainty relation. 
It is clearly different, however, from the other three uncertainty relations which 
relate to the three components of R and P [formulas (14) of complement F,]. 
In (D-69), only the energy is a physical quantity like R and P; t, on the other hand, 
is a parameter, with which no quantum mechanical operator is associated. 


E. THE SUPERPOSITION PRINCIPLE 
AND PHYSICAL PREDICTIONS 


The physical meaning of the first postulate remains to be examined. According to 
this postulate, the states of a physical system belong to a vector space and are, 
consequently, linearly superposable. 

One of the important consequences of the first postulate, when it is combined 
with the others, is the appearance of interference effects such as those which led us 
to wave-particle duality (chap. I). Our understanding of these phenomena is based 
on the concept of probability amplitudes, which we shall examine here with the aid 
of some simple examples. 
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1. Probability amplitudes and interference effects 

a. THE PHYSICAL MEANING OF A LINEAR SUPERPOSITION OF STATES 

a. The difference between a linear superposition and a statistical mixture 

Let | i//j ) and | \p 2 ) be two orthogonal normalized states: 

< 'I'i I > = < ^2 I ^2 > = 1 

<*, l* 2 >=0 (E-l) 

(I i/^ > and | \l/ 2 ) could be, for example, two eigenstates of the same observable B 
associated with two different eigenvalues 6, and b 2 ). 

If the system is in the state I ^, >, we can calculate all the probabilities 
concerning the measurement results for a given observable A. For example, if 
I u n > is the (normalized) eigenvector of A which corresponds to the eigenvalue a n 
(assumed to be non-degenerate), the probability 0 > l (a n ) of finding a n when A is 
measured on the system in the state 1 1 //, ) is: 

l<«„Ui>l 2 (E-2) 

An analogous quantity & 2 {a n ) can be defined for the state | i// 2 >: 

= I < «„ I *2 > I 2 (E-3) 

Now consider a normalized state | ip ) which is a linear superposition 
of I iAi > and | \l> 2 >: 

I * 1 * > = I fi > + ^2 I ^2 > 

UJ 2 + U 2 | 2 = 1 (E-4) 

It is often said that, when the system is in the state | ^ >, one has a probability 
UJ 2 of finding it in the state | i/f, ) and a probability |A 2 | 2 of finding it in the 
state | ip 2 >. The exact meaning of this manner of speaking is the following: 
if | \f/y y and | i jj 2 > are two eigenvectors (here assumed to be normalized) of 
the observable B corresponding to different eigenvalues b { and b 2 , the probability of 
finding b x when B is measured is UJ 2 and that of finding b 2 is U 2 | 2 . 

This could lead us to believe (wrongly, as we shall see), that a state such 
as (E-4) is a statistical mixture of the states | ij/ l > and | \p 2 ) with the weights 
|/,| 2 and U 2 | 2 . In other words, if we consider a large number N of identical systems, 
all in the state (E-4), we might imagine that this set of N systems in the state | i p ) 
was completely equivalent to another set composed of ./VUJ 2 systems in the state 
| i j/ff) and N\X 2 \ 2 systems in the state 1 1 /> 2 >. Such an interpretation of the state 
| ip ) is erroneous and leads to inaccurate physical predictions as we shall see. 

Assume that we are actually trying to calculate the probability a n ) 
of finding the eigenvalue a„ when the observable A is measured on the system in 
the state 1 1 {/ > given by (E-4). If we interpret the state | ^ > as being a statistical 
mixture of the states | \j/ l ) and | i // 2 ) with the weights |A,| 2 and U 2 | 2 , then we 
can obtain ^(a n ) by taking the weighted sum of the probabilities 0 > ,(a„) and 
0 > 2 (a n ) calculated above [formulas (E-2) and (E-3)]: 

0>(a n )l UJ 2 & l (a n )+ U 2 I 2 ^ 2 («J (E-5) 
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Actually, the postulates of quantum mechanics unambiguously indicate how to 
calculate a n ). The correct expression for this probability is: 

&{a n ) = |< u n I ^ >| 2 ( E ' 6 ) 

&(a n ) is therefore the square of the modulus of the probability amplitude 
< u n | ^ >. We see from (E-4) that this amplitude is the sum of two terms: 

< u„ | > = Aj < u„ | > + X 2 < u„ 1 1 // 2 > (E-7) 

Thus we obtain: 

^(«n) = \*1 < «„ | 'I'l > + X 2 < U n I ^2 >T 

= l^ll 2 |<«. | >| 2 + l^l 2 |< «n I ^2 >| 2 

+ 2 Re { AjAf < «„ | > < «„ | iA 2 >* } (E-8) 

Taking (E-2) and (E-3) into account, we find that the correct expression for 
a n ) is therefore written : 

&(a n ) = l^i) 2 &M n ) + W 2 ^ 2 K) + 2 Re { X V X\ < u„ \ ^ > < u n | >* } (E-9) 

This result is different from that of formula (E-5). 

It is therefore wrong to consider 1 1 // > to be a statistical mixture of states. 
Such an interpretation eliminates all the interference effects contained in the double 
product of formula (E-9). We see that it is not only the moduli of X 2 and X 2 
which play a role; the relative phase* of A x and X 2 is just as important, since 
it enters explicitly, through the intermediary of X t X^, into the physical predictions. 


P. A concrete example 

Consider photons propagating along Oz whose polarization state is repre¬ 
sented by the unit vector (fig. 5): 


e = 


v^ (C * + 


e y) 


(E-10) 


This state is a linear superposition of two orthogonal polarization states e x and e r 
It represents light which is linearly polarized at an angle of 45° with respect to 


e x and 


It would be absurd to assume that N photons in the state e are 

|2 * T 1 ^ i2 


equivalent to N x 


1 

V~2 


N . . . 

= -y photons in the state t x 


and N x 


1 

sfl 


N_ 

2 


photons in the state e r If we place in the beam’s trajectory an analyzer whose 
axis e' is perpendicular to e, we know that none of the N photons in the 
state e will pass through this analyzer. But, for the statistical mixture 

, half the photons will pass 

through the analyzer. 


{ 


N N 

— photons in the state e^, — photons in the state e 




* Multiplying | ^ > by a global phase factor e ,e is equivalent to changing A, and X 2 to A, e ,d and 
A 2 e te . It can be verified from (E-9) that such an operation does not modify the physical predictions, which 
depend only on |A,| 2 , |A 2 | 2 and A,AJ. 
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A simple experiment which illustrates the dif¬ 
ference between a linear superposition and a 
statistical mixture of states. If all the incident 
photons are in the polarization state 

none of them will pass through an analyzer 
whose axis e' is perpendicular to e. If we had, 
on the contrary, a statistical mixture of pho¬ 
tons polarized either along e x or along e^ (in 
equal proportions; i.e., natural light), half of 
them would pass through the analyzer. 


In this concrete example, it is clear that a linear superposition such as (E-10), 
associated with light polarized at an angle of 45° with respect to e x and e y , 
is physically different from a statistical mixture of equal proportions of the states 
e* and e v associated with natural light (an unpolarized beam). 

We can also understand the importance of the relative phase of the 
expansion coefficients of the state vector, by considering the four states: 


e ' Ji <e * 

e ’ = 7i <£ ' 

ei '72 (e “ 

“ = 7 ! ,e “ 

which differ only 
equal to 0, n, + ^ 


+ e >) 

(E-11) 

- e,) 

(E-12) 

+ fe,) 

(E-13) 

- fe,) 

(E-14) 


by the relative phase of the coefficients ^this phase being 
* \ 

and — —, respectively 1. These four states are physically quite 


different: the first two represent light which is polarized linearly along the 
bisectors of (e x , e y ); the second two represent circularly polarized light (right and 
left respectively). 


b. SUMMATION OVER THE INTERMEDIATE STATES 

a. Prediction of measurement results in two simple experiments 

(/) Experiment 1. Assume that the observable A has been measured, at a 
given time, on a physical system, and that the non-degenerate eigenvalue a has 
been found. If | u a > is the eigenvector associated with a, the physical system, 
immediately after the measurement, is in the state | u a ). 
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Before the system has had time to evolve, we measure another obser¬ 
vable C which does not commute with A. Using the notation introduced in 
§ C-6-a, we denote by ^ \(c ) the probability that this second measurement will yield 
the result c. Immediately before the measurement of C, the system is in the 
state | u a >. Therefore, if | > is the eigenvector of C associated with the eigen¬ 

value c (assumed to be non-degenerate), the postulates of quantum mechanics lead 
to: 

0 > a (c)=\<v c \u a >\ 2 (E-15) 

(//) Experiment 2. We now imagine another experiment, in which we measure 
successively and very rapidly three observables A, B, C, which do not commute 
with each other (the time separating two measurements is too short for the system 
to evolve). Denote by i? a (b, c) the probability, given that the result of the first 
measurement is a , that the results of the second and third will be b and c respectively. 
& a {b, c) is equal to the product of 8P a {b) (the probability that, the measurement 
of A having yielded a, that of B will yield b) and £? b {c) (the probability that, 
the measurement of B having yielded b , that of C will yield c): 

& a {b,c) = P„(b)P b (c) (E-16) 

If all the eigenvalues of B are assumed to be non-degenerate and if | w b ) 
denotes the corresponding eigenvectors, it follows that [using for & a (b ) and 
^ b (c) formulas analogous to (E-15)]: 

& a (b, c) = I < I w b > | 2 | < w b | w a > | 2 (E-17) 

3. The fundamental difference between these two experiments 

In both of these experiments, the state of the system after the measurement 
of the observable A is | u a ) (the role of this measurement being to fix this 
initial state). It then becomes | v c ) after the last measurement, that of the 
observable C (for this reasoil, | v c ) will be called the “final state”). It is possible 
in both cases to decompose the state of the system just before the measurement 
of C in terms of the eigenvectors | w b ) of B, and to say that between the state 
I u a ) and the state | v c >, the system “can pass” through several different 
“intermediate states” | w b >. Each of these intermediate states defines a possible 
“path” between the initial state | u a ) and the final state | u L . > (fig. 6). 



1 ^, > 


l*%> 


K> 


I 



FIGURE 6 

Different possible “ paths ” 
for the state vector of the 
system when the system is 
allowed to evolve freely 
(without undergoing any 
v ) measurement) between the 
initial state | u a > and the 
final state | v c ). In this 
case, we must add together 
the probability amplitudes 
associated with these dif¬ 
ferent paths, and not the 
probabilities. 
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The difference between the two experiments described above is the following. 
In the first one, the path that the system has taken between the state | u a ) and 
the state | v c ) is not determined experimentally [we measure only the probability 
9,{c) that, starting from | u a ), it ends up in | v c )]. On the other hand, in the 
second experiment, this path is determined, by measuring the observable B 
[thus enabling us to obtain the probability f? a {b, c) that the system, starting from 
| u a ), passes through a given intermediate state | w^ ) and ends up finally in 

i ■>, >]. 

We could then be tempted, in order to relate (c) to iPJJb, c), to use the follow¬ 
ing argument : in experiment 1, the system is “free to pass” through all intermediate 
states | w b ); it would then seem that the global probability t? a {c) should be 
equal to the sum of all the probabilities & a {b, c) associated with each of the possible 
“paths”. Can we not then write: 

W ^ X ?Jb,c) (E-18) 

b 

As we shall see, this formula is wrong. Let us go back to the exact formula 
(E-15) for & a {c) \ this formula brings in the probability amplitude < v r \ u a > which 
we can write, using the closure relation for the states | M' fc >: 

< o, | ” a > = X < v c I w b > < w b | »« > (E-19) 

6 

Substitute this expression into (E-15): 

2 

P a ic) = X<«U w b > < w b I «„ > 

b 

= X l< V C I W b >| 2 |< | M « >| 2 + 

b 

+ X X < V c | W b > < W b | > < L ’ C I W b >* < W b■ | U a >* (E-20) 

b b* b 

Using (E-17), we therefore obtain : 

•^ a (r) = X c ‘) + X X < V c | w b > < W b | U a > < V c | w b' >* < 'S- | W a >* 

b h b'-£ b 

(E-21) 

This equation enables us to understand why formula (E-18) is wrong: all the 
“cross terms” which appear in the square of the modulus of sum (E-19) are 
absent in (E-18). All the interference effects between the different possible paths 
are thus missing in (E-18). 

If, therefore, we want to establish a relation between these two experiments, 
we see that it is necessary to reason in terms of probability amplitudes. 
When the intermediate states of the system are not determined experimentally , 
it is the probability amplitudes , and not the probabilities, which must be summed. 

The error in the reasoning which led to the wrong relation (E-18) is 
obvious, moreover, if we remember the fifth postulate (reduction of the wave 
packet). In the second experiment, the measurement of the observable B must, 
in fact, involve a perturbation of the system under study : during the measurement 
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its state vector undergoes an abrupt change (projection onto one of the states 
| w b >). It is this unavoidable perturbation which is responsible for the disap¬ 
pearance of interference effects. In the first experiment, on the other hand, 
it is incorrect to say that the physical system “passes through one or another 
of the states | w b >”; it would be more accurate to say that it passes through 
all the states | >. 

comments: 

(i) The preceding discussion resembles in every respect that of §A-2-a of 

chapter I concerning Young’s double-slit experiment. To determine the 
probability that a photon emitted by the source will arrive at a given point M 
of the screen, one must first calculate the total electric field at M. In this 
problem, the electric field plays the role of a probability amplitude. When 
one is not trying to determine through which slit the photon passes, 

it is the electric fields radiated by the two slits, and not their intensities, 

that must be added together to obtain the total field at M (whose square 
yields the desired probability). In other words, the field radiated by one of 
the slits at the point M represents the amplitude for a photon, emitted 
by the source, to pass through this slit before arriving at M. 

(ii) It is not necessary to retain the assumption that the measurements of A 

and C in experiment 1 and of A , B, C in experiment 2 are performed 

very close together in time. If the system has had time to evolve between 

two of these measurements, we can use the Schrodinger equation to 
determine the modification of the state of the system due to this evolution 
[cf. complement F,„, comment (ii) of §2]. 


C. CONCLUSION: THE IMPORTANCE OF THE CONCEPT 

OF PROBABILITY AMPLITUDES 

The two examples studied in §§a and b demonstrate the importance of the 
concept of probability amplitudes. Formulas (E-5) and (E-18), as well as the 
arguments which lead to them, are incorrect since they represent an attempt to 
calculate a probability directly without first considering the corresponding proba¬ 
bility amplitude. In both cases, the correct expression (E-8) or (E-20) has the 
form of a square of a sum (more precisely, the square of the modulus of this sum), 
while the incorrect formula (E-5) or (E-18) contains only a sum of squares (all 
the cross terms, responsible for interference effects, being omitted). 

From the preceding discussion, we shall therefore retain the following 

ideas: 


(i) The probabilistic predictions of quantum theory are always obtained by 
squaring the modulus of a probability amplitude. 

(ii) When, in a particular experiment, no measurement is made at an inter¬ 
mediate stage, one must never reason in terms of the probabilities of the various 
results that could have been obtained in such a measurement, but rather in terms 
of their probability amplitudes. 
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(hi) The fact that the states of a physical system are linearly superposable means 
that a probability amplitude often presents the form of a sum of partial amplitudes. 
The corresponding probability is then equal to the square of the modulus of a 
sum of terms, and the various partial amplitudes interfere with each other. 


2 . Case in which several states can be associated 

with the same measurement result 

In the preceding section, we stressed and illustrated the fact that, in 
certain cases, the probability of an event is given by the postulates of quantum 
mechanics in the form of a square of a sum of terms (more precisely, the square 
of the modulus of such a sum). Now the statement of the fourth postulate 
[formula (B-7)] involves a sum of squares (the sum of the squares of the moduli) 
when the measurement result whose probability is sought is associated with a 
degenerate eigenvalue. It is important to understand that these two rules are not 
contradictory but, on the contrary, complementary: each term of the sum of 
squares (B-7) can itself be the square of a sum. This is the first point on which 
we shall focus our attention in this section. Furthermore, this discussion will 
enable us to complete the statement of the postulates: we shall consider measurement 
devices whose accurary is limited (as is always, of course, the case) and see how 
to predict theoretically the possible results. Finally, we shall extend to the case 
of continuous spectra the fifth postulate of reduction of the wave packet. 


a. DEGENERATE EIGENVALUES 

In the examples treated in §E-1, we always assumed that the results 
of the various measurements envisaged were simple, i.e. non-degenerate, eigen¬ 
values of the corresponding observables. This hypothesis was intended to simplify 
these examples so that the origin of the interference effects appeared as clearly 
as possible. 

Now let us consider a degenerate eigenvalue a„ of an observable A. The 
eigenstates associated with a n form a vector subspace of dimension g n , in which an 
orthonormal basis { | u' n >; i = 1, 2,..., g n } can be chosen. 

The discussion of §C-6-b shows that knowing a measurement of A has 
yielded a„ is not sufficient to determine the state of the physical system after this 
measurement. We shall say that several final states can be associated with the same 
result a n : if the initial state (the state before the measurement) is given, the final 
state after the measurement is perfectly well-defined; but if the initial state is 
changed, the final state is, in general, different (for the same measurement result a n ). 
All final states associated with a n are linear combinations of the g„ orthonormal 
vectors | u' n >, with / = 1,2, ... g n . 

Formula (B-7) indicates unambiguously how to find the probability &(a n ) 
that a measurement of A on a system in the state | i/i ) will yield the result a n . One 
chooses an orthonormal basis, for example { | u‘ a }; i — 1 , 2, ..., g n }, in the eigen- 
subspace which corresponds to a„ \ one calculates the probability l<w'|i/')l 2 of 
finding the system in each of the states of this basis; a n ) is then the sum of these g n 
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probabilities. However, it must not be forgotten that each probability |<wj^>l 2 
can be the square of the modulus of a sum of terms. Consider, for example, the 
case envisaged in §E-l-a-a and assume now that the eigenvalue a n of the 
observable A, whose probability 0*(a n ) is to be calculated, is g n -fold degenerate. 
Formula (E-6) is then replaced by : 


■naj = 11< «i | ^ >| 2 

i = i 

tE-22) 

< “■' 1 ^ > = *1 < M « 1 ^1 > + 4 < < | *l/ 2 > 

(E-23) 


The discussion of §E-l-a-a remains valid for each of the terms of formula (E-22): 
k w «l<A>l 2 > which is obtained from (E-23), is the square of a sum; 0>(a n ) is then 
the sum of these squares. §E-l-b can similarly be generalized to the case where 
the eigenvalues of the observables measured are degenerate. 

Before summarizing the preceding discussions, we are going to study another 
important situation where several final states are associated with the same measure¬ 
ment result. 


b. INSUFFICIENTLY SELECTIVE MEASUREMENT DEVICES 


a. Definition 

Assume that, in order to measure the observable A in a given physical system, we have 
at our disposal a device which works in the following wav: 

(/) This device can give only two different answers*, which we shall denote, for 
convenience, by “yes” and “no”. 

(ii) If the system is in an eigenstate of A whose eigenvalue is included in a given 
interval A of the real axis, the answer is always “yes”; this is also the case when the state of the 
system is any linear combination of eigenstates of A associated with eigenvalues which are 
all included in A . 

(///) If the state of the system is an eigenstate of A whose eigenvalue falls outside d, 
or any linear combination of such eigenstates, the answer is always “no” 

A therefore characterizes the resolving power of the measurement device under 
consideration. If there exists only one eigenvalue a n of A in the interval A, the resolving power is 
infinite : when the system is in an arbitrary state, the probability ^(yes) of obtaining 
the answer “yes” is equal to the probability of finding a n in a measurement of A ; the proba¬ 
bility ^(no) of obtaining “no” is obviously equal to 1 - ^(yes). If, on the other hand, 
A contains several eigenvalues of A , the device does not have a sufficient resolution to distinguish 
between these various eigenvalues: we shall say that it is insufficiently selective. We shall see 
how to calculate ^(yes) and ^(no) in this case. 

To be able to study the perturbation created by such a measurement on the state of the 
system, we are going to add the following hypothesis: the device transmits without pertur¬ 
bation the eigenstates of A associated with the eigenvalues of the interval A (as well as any linear 
combination of these eigenstates), while it “blocks” the eigenstates of A associated with the 
eigenvalues outside A (as well as all their linear combinations). The device thus behaves like 
a perfect filter for all states associated with A 


* The following arguments can easily be generalised to cases where the device can give several 
different answers having characteristics similar to those described in (ii) and (///). 
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p. Example 

Most measurement devices used in practice are insufficiently selective. 

For example, to measure the abscissa-of an electron propagating parallel to the Oz axis, 
one can (fig. 7) place in the xOy plane (Ov is perpendicular to the plane of the figure) a plate 
with a slit whose axis is parallel to Oy\ the abscissas of the edges being x, and x 2 . It can then 
be seen that any wave packet which is entirely included between the x — x l and x = x 2 planes 
(a superposition of eigenstates of A* having eigenvalues x contained within the interval [x t , x 2 ]) 
will enter the region to the right of the slit ("yes" answer); in this case, it will not undergo any 
modification. On the other hand, any wave packet situated below the a = x t plane or above 
the x = x 2 plane will be blocked by the plate and will not pass to the right ("no" answer). 



hgure 7 

x | Schematic drawing of a device for measu¬ 

ring the abscissa x of a particle. Since the 

-►z interval {x t ,x 2 } is necessarily non-zero, 

such a device is always imperfectly selec¬ 
tive. 


y. Quantum description 

For such an insufficiently selective device, several final states are possible after a 
measurement which has yielded the answer yes, for example, the various eigenstates of A which 
correspond to the eigenvalues of the interval d. 

The physical problem posed by such devices, and which we are now going to consider, 
consists of predicting the answer which will be obtained when a system in an arbitrary state 
enters the device. For example, for the apparatus of figure 7, what happens when we are dealing 
with a wave packet which is neither entirely contained between the x = x\ and x = x 2 planes 
(in which case the answer is certainly yes) nor entirely situated outside this region (in which 
case the answer is certainly no)? We shall see that this is equivalent to measuring an observable 
whose spectrum is degenerate. 

Consider the subspace S' A spanned by all the eigenstates of A whose eigenvalues a n are 
contained within the interval d, The projector P A onto this subspace is written (cj. §B-3-b-y 
of chap. II): 

“ill4 ><4,1 (*-24) 

u tt cA i ~ I 


(the eigenvalues a n of the interval d can be degenerate, hence the additional index /; the 
vectors | u' n > are assumed to be orthonormal). S A is the subspace formed by all the possible states 
of the system after a measurement which has given the result yes. 

Referring to the definition of the measurement device, we see that the response will 
certainly be yes for any state belonging to S A , that is, for any eigenstate of P A with the eigenvalue 
of f I The answer will certainly be no for any state belonging to the supplement of <f A . 
that is, for any eigenstates of P A with the eigenvalue of 0. The yes and no answers which can 
be furnished by the measurement device therefore correspond to the eigenvalues + I and 0 of the 
observable P A : it could be said that the device is actually measuring the observable P A rather 
than A . 
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In the light of this interpretation, the case of an insufficiently selective measurement 
device can be treated in the framework of the postulates which we have stated. The proba¬ 
bility ^(yes) of obtaining the answer yes is equal to the probability of finding the (degenerate) 
eigenvalue + 1 of P A . Now an orthonormal basis in the corresponding eigensubspace is 
constituted by the set of states | u t j > which are eigenstates of A with eigenvalues contained within 
the interval A. Applying formula (B-7) to the eigenvalue + 1 of the observable P A> we therefore 
obtain (for a system in the state | \j/ >): 

9n 

£S»(yes) = X S l<«’k >| 2 ( E ‘ 25 ) 

a n eA i = 1 

Since there are only two possible answers, we obviously have: 

^(no) = 1 - ^(yes) (E-26) 

The projector onto the eigensubspace associated with the eigenvalue + 1 of the obser¬ 
vable P A is P A itself; formula (B-14) therefore gives here: 

^(yes) = < ^ | P A I ^ > (E-27) 

[this formula is equivalent to (E-25)]. 

Similarly, since the device does not perturb states belonging to $ A and blocks those of the 
supplement of $ A , we find that the state of the system after a measurement which has given 
the result yes is: 

I r > = , . . 1 ■ .= i 11> < < i*> o-28) 

/I t \«\^>\ 2 

V 1= 1 

that is: 

\V > = 7 =L= P a | * > (E-29) 

V < * | P A | * > 

When A contains only one eigenvalue a n , P A reduces to P „: formulas (B-14) and (B-31) are then 
seen to be special cases of formulas (E-27) and (E-29). 

c. RECAPITULATION: MUST ONE SUM THE AMPLITUDES 

OR THE PROBABILITIES 7 

There are therefore cases (§ E-1) where, to calculate a probability, one takes 
the square of a sum, because several probability amplitudes must be added together. 
In other cases (§E-2), one takes a sum of squares, because several probabilities 
must be added together. It is clearly important not to confuse these different cases 
and to know, in a given situation, if it is the probability amplitudes or the 
probabilities themselves which must be summed. 

Young’s double-slit experiment will again furnish us with a very convenient 
physical example which will enable us to illustrate and summarize the preceding 
discussions. Assume that we want to calculate the probability for a particular 
photon to strike the plate between two points M x and M 2 having abscissas of x 1 
and x 2 (fig. 8). This probability is proportional to the total light intensity received 
by this portion of the plate. It is therefore a “sum of squares”; more precisely, 
it is the integral of the intensity I(x) between x, and x 2 . But each term /(x) of this 
sum is obtained by squaring the electric field £(x) at x, which is equal to the sum 
of the electric fields £^(x) and £ B (x) radiated at M by the slits A and B. I(x) is 
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therefore proportional to + fi B (.x)| 2 , that is, to the square of a sum. 

£ a {x) and £ B (x) are the amplitudes associated with the two possible paths SAM 
and SBM which end at the same point M ; they are added to obtain the amplitude 
at M since one is not trying to determine through which slit the photon passes. 
Then, to calculate the total light intensity received by the interval M i M 2 , one 
adds the intensities which arrive at the various points of this interval. 



FIGURE 8 

Young’s double-slit experiment. To calcu¬ 
late the probability density for detecting a 
photon at the point Af, it is necessary to 
add the electric fields radiated by the 
slits A and B, then to square the field thus 
obtained (“ square of the sum "). The pro¬ 
bability of finding a photon in the interval 
[*!, x 2 ] is now obtained by summing this 
probability density between x x and x 2 
(“sum of squares”). 


To sum up, the fundamental idea which must be retained from the discussions 
of this section can be expressed schematically in the following way: 


Add the amplitudes corresponding to the same final state, then the probabilities 
corresponding to orthogonal final states. 


d. APPLICATION TO THE TREATMENT OF CONTINUOUS SPECTRA 


When the observable we want to measure has a continuous spectrum, only insufficiently 
selective devices can be used: it is impossible to imagine a physical device which could isolate 
a single eigenvalue belonging to a continuous set. We shall see here how the study of §E-2-b 
enables us to be more precise and complete in our treatment of observables with continuous 
spectra. 


a. Example : measurement of the position of a particle 

Let i^(r) = < r | \j/ > be the wave function of a (spinless) particle. What is the probability 
of finding the abscissa of this particle within the interval [x u x 2 ] of the x-axis, using, for 
example, a measurement device like the one in figure 7 ? 

The subspace S A associated with this measurement result is the space spanned by the 
kets | r > = | x, y, z > which are such that: Xj ^ x ^ x 2 . Since these kets are orthonormal in the 
extended sense, application of the rule stated in § c above yields: 


C X2 i 

* + ® * 

dx 

d y 


-oo * 

C X2 

» + ® y* 

1 dx 

d y 


-oo J ■ 


+ oo 


(E-30) 
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Formula (E-27) obviously leads to the same result, since the projector P 4 is written here: 


y»X2 /* + 00 I* + OO 

= dx dy I dz | x, y, z > < x, y, z \ 

* -r. w — an * — oo 


(E-31) 


and we therefore have: 

^(x, < x x 2 ) 






dz < \j/ | x, y, z > < x, y, z | <// > 


(E-32) 


To know the state | \jt' > of the particle after such a measurement, which has yielded 
the result yes, it suffices to apply formula (E-29): 

If 


t i* X 2 /*+00 ~ + CC 

= — j dx' J d/ J dz' | x', y', z' > < x', y', z' | ^ > 


(E-33) 


where the normalization factor N = v' < | P | V ) is known [formula (E-32)]. Let us calculate 

the wave function i//'(r) = < r | \j/' > associated with the ket | ): 


. /*Jt2 /» + 00 /» + 00 

< r | \l/' > = — dx' dy' j dz' < r 1 1 ^ (r') (E-34) 

Now < r | r' > = <5(r - r') = <5(x - x') 5(y - y') 5(z - z'). The integrations over y' and z' can 
therefore be performed immediately : they amount to replacing y' and z' by y and z in the 
function to be integrated. Equation (E-34) thus becomes : 


^(x,y,z)=^J 


d x'S(x - x') \j/(x\y y z) 


(E-35) 


If the point x f — x is situated inside the interval of integration [x l9 x 2 ], the result is the same 
as if we were integrating from — oo to + oo: 

\l/'{x y y, z) = [p(x y y, z) for x t ^ x ^ x 2 (E-36) 

On the other hand, if x' - x falls outside the interval of integration, <5(x - x') is zero for all 
values of x f included in this interval, and: 

i j/ f (x y y, z) = 0 for x > x 2 and x < x t (E-37) 


The part of ^(r) which corresponds to the interval accepted by the measurement device 
therefore persists, undeformed, immediately after the measurement [the factor l/N simply 
assures that i^'(r) remain normalized]; the rest is suppressed by the measurement. The wave 
packet i^(r) representing the initial state of the particle is, as it were, “truncated” by the edges 
of the slit. 


comments : 

(?) This example clearly reveals the concrete meaning of the “reduction of the wave packet”. 

(ii ) If a large number of particles, all in the same state | \j/ >, enter the device successively, 

the result will sometimes be yes and sometimes be no [with the probabilities ^(yes) 
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and ^(no)]. If the result is yes, the particle continues on its way, starting from the 
“truncated” state | \j/' >; if the result is no, the particle is absorbed by the screen. 

In the example we are considering here, the measurement device becomes all the more 
selective as x 2 — x x becomes smaller. We see, however, that it is impossible to make it 
perfectly selective because the spectrum of X is continuous: however narrow the slit may be, the 
interval [x x , x 2 ] which it defines always contains an infinity of eigenvalues. Nevertheless, in the 
limiting case of a slit of an infinitely small width Ax, we find the equivalent of formula (B-17), 
which was the expression of the fourth postulate in the case of a continuous spectrum. Let us 

choose = x 0 — 4 ^ and x 2 = x 0 + 4 ^ (a slit of width Ax centered at x 0 ), and assume that 

the wave function ^(r) varies very little within the interval Ax. Then, in (E-30), we can replace 
|^r(r)| 2 by |^(x 0 , y, z)\ 2 and perform the integration over x: 



Ax 


< x < x, 


0 + -y) - Ax f dy f d z |^(x 0 , y, z)\ 2 

' " — on * — on 


(E-38) 


We indeed find a probability equal to the product of Ax and a positive quantity which plays 
the role of a probability density at the point x 0 . The difference with formula (B-17) lies in the 
fact that the latter applies to the case of a continuous but non-degenerate spectrum, while here 
the eigenvalues of X are infinitely degenerate in S w ; this is the origin of the integrals over y and z 
which appear in (E-38) (summation over the indices associated with the degeneracy). 


(i. Postulate of reduction of wave packets in the case of a continuous spectrum 

In §B-3-c, we confined ourselves, in the statement of the fifth postulate, to the case of a 
discrete spectrum. Formula (E-33) and its accompanying discussion enable us to understand the 
form assumed by this postulate when a continuous spectrum is considered: it is sufficient to 
apply the results of §E-2-b concerning insufficiently selective devices. 

Let A be an observable with a continuous spectrum (assumed, for simplicity, to be non¬ 
degenerate). The notation is the same as in §B-3-b-fJ. 


If a measurement of A on a system in the state | \j/ ) has yielded the result a 0 
to within A a, the state of the system immediately after this measurement is described by: 

| > = ", =.. = = PJM I + > (E-39) 

V < * | PJ* o) | * > 

with: 


Aa 



2 


(E-40) 


Figures 9-a and 9-b illustrate this statement. If the function < v a \ \jt > representing | \p > 
in the { | v a > } basis, has the form indicated in figure 9-a, the state of the system immediately 
after the measurement is represented, to within a normalization factor, by the function of 
figure 9-b [the calculation is analogous in all respects to the one which derives (E-36) and (E-37) 
from (E-33)]. 

We see that, even if da is very small, one can never actually prepare the system in the 
state | v ao ), which would be represented, in the { | v a > } basis, by < v a \ v aQ > = <5 (a - a 0 ). We can 
only obtain a narrow function centered at a 0 , since da is never exactly zero. 
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FIGURE 9 

Illustration of the postulate of reduction of wave packets in the case of a continuous spectrum: one 
measures the observable A , with eigenvectors | v a > and eigenvalues a. The measurement device has 
a selectivity Joe. If the value found is oc 0 to within Jot, the effect of the measurement on the wave func¬ 
tion < v a | ^ >isto “truncate” it about the value a 0 (to normalize the new wave function, it is obviously 
necessary to multiply it by a factor larger than 1). 


References and suggestions for further reading : 

Development of quantum mechanical concepts: references of section 4 of the 
bibliography, particularly Jammer (4.8). 

Discussion and interpretation of the postulates: references of section 5 of the 
bibliography; Von Neumann (10.10), chaps. V and VI; Feynman III (1.2), §2.6, chap. 3 
and §8.3. 

Quantization rules using Poisson brackets: Dirac (1.13), §21; SchifF (1.18), §24. 

Probability and statistics : see the corresponding subsection of section 10 of the 
bibliography. 
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COMPLEMENTS OF CHAPTER III 

Am: particle A,,,, B,„: direct applications of chapter III to 

in an infinite potential well simple cases. The accent is placed on the physical 

discussion of the results (elementary level). 

B m : STUDY OF THE PROBABILITY CURRENT 
IN SOME SPECIAL CASES 


C|ii: ROOT-mean-square deviations C m : a little more formal; general proof of the 

OF TWO conjugate observables Heisenberg relations; may be skipped on a first 

reading. 


D mi : MEASUREMENTS BEARING 
ON ONLY ONE PART 
OF A PHYSICAL SYSTEM 


D m : discussion of measurements bearing only 
on part of a system; a rather simple but somewhat 
formal application of chapter III; may be 
skipped on a first reading. 


E|„: THE DENSITY OPERATOR 

F m : THE EVOLUTION OPERATOR 

Gin: THE SCHRODINGER 

AND HEISENBERG PICTURES 

H mi : GAUGE INVARIANCE 

J,||: PROPAGATOR 

FOR THE SCHRODINGER EQUATION 


E„„ Fiii> G„i* H„„ J,„: complements which 
serve as an introduction to a more advanced 
quantum mechanics course. Aside from F in , 
which is simple, they are on a higher level 
than the rest of this book, but they are com¬ 
prehensible if chapter III has been read. To be 
reserved for subsequent study. 

E m : definition and properties of the density 
operator, which is used in the quantum mecha¬ 
nical description of systems whose state is 
imperfectly known (statistical mixture of states); 
fundamental tool of quantum statistical mecha¬ 
nics. 

F,„: introduction of the evolution operator, 
which gives the quantum state of a system at an 
arbitrary instant t in terms of its state at the 
instant f 0 . 

G„,: describes the evolution of a quantum 
system in a way which is different from, but 
equivalent to, that of chapter III. The time 
dependence now appears in the observables and 
not in the state of the system. 

H ni : discussion of the quantum formalism in 
the case where the system is subject to an elec¬ 
tromagnetic field. Although the description of 
the system involves the electromagnetic poten¬ 
tials, the physical properties depend only on the 
values of the electric and magnetic fields; they 
remain invariant when the potentials describing 
the electromagnetic field are changed. 

J,„ : an introduction to a different way of 
approaching quantum mechanics, based on a 
principle analogous to Huygens’ principle in 
classical wave optics. 
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COMPLEMENTS OF CHAPTER III (continued) 


K,,,: UNSTABLE LEVELS; 
LIFETIMES 


EXERCISES 


M In : BOUND STATES OF A PARTICULE 
IN A "POTENTIAL WELL" 

OF ARBITRARY SHAPE 

N,„: UNBOUND STATES 
OF A PARTICLE 
IN THE PRESENCE 
OF A POTENTIAL WELL 
OR BARRIER OF ARBITRARY SHAPE 

O m : QUANTUM PROPERTIES 
OF A PARTICLE 
IN A ONE-DIMENSIONAL 
PERIODIC STRUCTURE 


K,„: simple introduction to the important physi¬ 
cal concepts of instability and lifetimes; easy, but 
can be skipped on a first reading. 


M n „ N m , 0 IH : return to one-dimensional pro¬ 
blems, considered from a more general point of 
view than in chapter I and its complements. 

M hi : generalization to an arbitrary potential 
well of the principal results obtained in § 2-c of 
complement H,; recommended, since easy and 
physically important. 

N m : study of unbound stationary states in an 
arbitrary potential; a little more formal; the 
definitions and results of this complement are 
necessary for complement 0,„. 

0,„: introduction of the concept (which is 
fundamental to solid state physics) of energy 
bands in a potential having a periodic structure 
(this concept will be treated differently in 
complement F XI ); rather difficult, can be reser¬ 
ved for later reading. 
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Complement Am 

PARTICLE IN AN INFINITE POTENTIAL WELL 


1. Distribution of the momentum values in a stationary state 

a. Calculation of the function <p„(p)> of < P > and of AP 

b. Discussion 

2. Evolution of the particle's wave function 

a. Wave function at the instant t 

b. Evolution of the shape of the wave packet 

c. Motion of center of the wave packet 

3. Perturbation created by a position measurement 


In complement H, (§2-c-p), we studied the stationary states of a particle in a one¬ 
dimensional infinite potential well. Here we intend to re-examine this subject from 
a physical point of view. This will allow us to apply some of the postulates of 
chapter III to a concrete case. We shall be particularly interested in the results that 
can be obtained when the position or momentum of the particle is measured. 


Distribution of the momentum values in a stationary state 


a. CALCULATION OF THE FUNCTION 7p„(P). OF <P> AND OF AP 


energies*: 


We have seen that the stationary states of the particle correspond to the 

0) 

2 ma 1 

and to the wave functions: 

, . fl . /nnx\ 

(where a is the width of the well and n is any positive integer). 

Consider a particle in the state | (p n ), with energy E n . The probability of a 
measurement of the momentum P of the particle yielding a result between p 
and p + dp is: 


( 2 ) 


Wdp = \vM 2 d P 


with: 






(3) 


(4) 


* We shall use the notation of complement H,. 
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This integral is easy to calculate; it is equal to: 

«rt—-J=r[e<T-f>-e-<T*f>l d , 

2 i V nha Jo L J 


that is: 


2 i v nha 

\a n 




-ip+l 

\ a n 


with: 


( 7 ) 

To within a proportionality factor, the function Tp„{p) is the sum (or the 
difference) of two “diffraction functions” f(p ± centered at p = + 


The “width” of these functions (the distance between the first two zeros, symme¬ 
trical with respect to the central value) does not depend on n and is equal 
4ufi 

to Their “amplitude” does not depend on n either. 

The function inside brackets in expression ( 6 ) is even if n is odd, and odd if n 
is even. The probability density ^„(p) given in (3) is therefore an even function of p 
in all cases, so that : 


<o 


-J. 


&n(P)P d P =0 


The mean value of the momentum of the particle in the energy state E n is therefore 
zero. 

Let us calculate, in the same way, the mean value < P 2 )„ of the square of the 
momentum. Using the fact that in the { | x> } representation P acts like - 7 -^, 
and performing an integration by parts, we obtain*: 


<P 2 > 


■-M' 

- / v 2 

2 /M7C\ 

I a \ a / 
Jo V / 


* Result (9) could also be derived from (6) by performing the integral < P 2 > 


-j: 


k„(p)l 2 p 2 dp- 


This calculation, which presents no theoretical difficulties, is nevertheless not as direct as the one which 
is given here. 
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From (8) and (9), we get: 

*P n = \/< P 1 >„ - (TVl = ^ ( 10 ) 

The root-mean-square deviation therefore increases linearly with n. 


b. DISCUSSION 

Let us trace, for different values of n, the curves which give the probability 
density 0"„{p). To do this, let us begin by studying the function inside brackets 
in expression (6). For the ground state (n = 1), it is the sum of two functions F, 
the centers of these two diffraction curves being separated by half their width 
(fig. 1-a). For the first excited level (n = 2), the distance between these centers is 
twice as large, and in this case, moreover, the difference of two functions F must 
be taken (fig. 2-a). Finally, for an excited level corresponding to a large value of n, 
the centers of the two diffraction curves are separated by a distance much greater 
than their width. 



The wave function <Pt(p), associated in the { | /j > } representation with the ground state of a par¬ 
ticle in a infinite well, is obtained by adding two diffraction functions F (curves in dashed lines in 
figure a). Since the centers of these two functions F are separated by half their width, their sum has 
the shape represented by the solid-line curve in figure a. Squaring this sum, one obtains the proba¬ 
bility density ^(p) associated with a measurement of the momentum of the particle (fig. b). 


Squaring these functions, one obtains the probability density &„{p) ( cf. fig. 1-b 


and 2-b). Note that for large n the interference term between F 


( nrih\ 

i p --J 


and 
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FIGURE 2 

For the first excited level, the function <p 2 (p) obtained by taking the difference between two func¬ 
tions F, which have the same width as in figure 1-a but are now more widely separated (dashed-line 

curve in figure a). The curve obtained is the solid line in figure a. The probability density & 2 (p) ^ 
has two maxima located in the neighborhood of p — ± Inhja (fig. b). 


Fyp H-j is negligible (because of the separation of the centers of the two 


curves): 




The function &„(p) then has the shape shown in figure 3. 

It can be seen that when n is large, the probability density has two symmetrical 

peaks, of width centered at p = + It is then possible to predict with 
a a 

almost complete certainty the results of a measurement of the momentum of the 
particle in the state | q>„ > : the value found will be nearly equal to + —or 

— the relative accuracy* improving as n increases (the two opposite 


The absolute accuracy is independent of n, since the width of the curves is always-. 

a 
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values ± being equally probable). This is simple to understand: for large n, 

the function <p„(x), which varies sinusoidally, performs numerous oscillations inside 
the well; it can then be considered to be practically the sum of two progressive 

waves corresponding to opposite momenta p = ± — 

a 



FIGURE 3 


When n is large (a very excited level), the probability density has two pronounced peaks, centered at 
the values p = ± tilth/a, which are the momenta associated with the classical motion at the same 
energy. 


When n decreases, the relative accuracy with which one can predict the possible 
values of the momentum diminishes. We see, for example, in figure 2-b, that 
when n — 2, the function &„(p) has two peaks whose widths are comparable to their 
distance from the origin. In this case, the wave function undergoes only one 
oscillation inside the well. It is not surprising that, for this sinusoid “truncated” 
at x = 0 and x = a, the wavelength (and therefore, the momentum of the particle) 
is poorly defined. Finally, for the ground state, the wave function is represented 
by half a sinusoidal arc: the relative values of the wavelength and momentum of the 
particle are then very poorly known (fig. 1-b). 

comments: 

(i) Let us calculate the momentum of a classical particle of energy E n given 


in (1); we have: 


Pc, _ n 2 n 2 h 2 

2m 2 ma 2 

(12) 

that is: 


. nnh 
p„=±— 

(13) 


When n is large, the two peaks of &„{p) therefore correspond to the classical 
values of the momentum. 
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(ii) We see that, for large n, although the absolute value of the momentum is 
well-defined, its sign is not. This is why AP n is large: for probability distri¬ 
butions with two maxima like that of figure 3, the root-mean-square 
deviation reflects the distance between the two peaks; it is no longer related 
to their widths. 


2. Evolution of the particle's wave function 

Each of the states | q>„ >, with its wave function <p„(x), describes a stationary 
state, which leads to time-independent physical predictions. Time evolution 
appears only when the state vector is a linear combination of several kets | (p n ). 
We shall consider here a very simple case, for which at time t = 0 the state vector 
U(0 )>is: 

\m>=A=l\(Pi> + | <*>2 >] 04) 



(15) 

(16) 

(17) 


b. EVOLUTION OF THE SHAPE OF THE WAVE PACKET 

The shape of the wave packet is given by the probability density: 

\Hx, 0| 2 = \ (p\(x) + ^ + (p^x) (p 2 (x )cos co 2l t (18) 

We see that the time variation of the probability density is due to the interference 
term in (p l (p 2 . Only one Bohr frequency appears, v 21 = (E 2 — )jh, since the 
initial state (14) is composed only of the two states | (p 1 } and | (p 2 ). The curves 
corresponding to the variation of the functions <p\, <p 2 and (f> 2 (p 2 are traced in 
figures 4-a, b and c. 
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FIGURE 4 

Graphical representation of the functions <p] (the probability density of the particle in the 
ground state), <p\ (the probability density of the particle in the first excited state) and <p x <p 2 
(the cross term responsible for the evolution of the shape of the wave packet). 


Using these figures and relation (18), it is not difficult to represent graphically 
the variation in time of the shape of the wave packet (cf. fig. 5): we see that the 
wave packet oscillates between the two walls of the well. 







FIGURE 5 

Periodic motion of a wave packet obtained by superposing the ground state and the first excited state 
of a particle in an infinite well. The frequency of the motion is the Bohr frequency a) 2 J2n. 
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c. MOTION OF THE CENTER OF THE WAVE PACKET 

Let us calculate the mean value < X >(/) of the position of the particle at 
time t. It is convenient to take: 

X' = X - a/2 (19) 

since, by symmetry, the diagonal matrix elements of X' are zero: 

< <Pi | X' I <Pi > « j* ( x ~ sin2 (t) dx = 0 

< (P 2 | x ' | (Pi > j* (x - sin 2 dx = 0 (20) 

We then have: 


< X' >(0 = Re { e -i "»i , < (Pi \ X' \ (p 2 y } (21) 

with: 

< <Pi | X’ | (Pi > = < <Pi | X | (p 2 > - | < (Pi | (p 2 > 



FIGURE 6 

Time variation of the mean value < X > corresponding to the wave packet of figure 5. The dashed 
line represents the position of a classical particle moving with the same period. Quantum mechanics 
predicts that the center of the wave packet will turn back before reaching the wall, as explained by 
the action of the potential on the “edges” of the wave packet. 
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The variation of < X)(/) is represented in figure 6. In dashed lines, the 
variation of the position of a classical particle has been traced, for a particle moving 
to and fro in the well with an angular frequency of co 21 (since it is not subjected to 
any force except at the walls, its position varies linearly with t between 0 
and a during each half-period). 

We immediately notice a very clear difference between these two types of 
motion, classical and quantum mechanical. The center of the quantum wave 
packet, instead of turning back at the walls of the well, executes a movement of 
smaller amplitude and retraces its steps before reaching the regions where the 
potential is not zero. We see again here a result of §D-2 of chapter I: since the 
potential varies infinitely quickly at x = 0 and x = a, its variation within a domain 
of the order of the dimension of the wave packet is not negligible, and the motion 
of the center of the wave packet does not obey the laws of classical mechanics 
(see also chapter III, §D-l-d-y). The physical explanation of this phenomenon 
is the following : before the center of the wave packet has touched the wall, the 
action of the potential on the “edges” of this packet is sufficient to make it turn 
back. 


comment : 


The mean value of the energy of the particle in the state 1 1 j/(t )) 
calculated in (15) is easy to obtain: 

<H> = \e, +±E 2 =| E, (24) 

as is: 

<tf 2 > =±Ej +- 2 E\ =H £ 2 (25) 

which gives: 

AH = \ £, (26) 


Note in particular that < H ), < H 2 ) and AH are not time-dependent; since H 
is a constant of the motion, this could have been foreseen. In addition, we see from 
the preceding discussion that the wave packet evolves appreciably over a time of 
the order of: 


At ~ 


1 

to 21 


Using (26) and (27), we find: 


AH * 3 §; 


n 

2 


(27) 


(28) 


We again find the time-energy uncertainty relation. 
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3. Perturbation created by a position measurement 


Consider a particle in the state | (p l >. Assume that the position of the particle 
is measured at time / = 0, with the result x = a/2. What are the probabilities of the 
different results that can be obtained in a measurement of the energy, performed 
immediately after this first measurement ? 

One must beware of the following false argument: after the measurement, 
the particle is in the eigenstate of X corresponding to the result found, and its wave 
function is therefore proportional to <5(x — a/2); if a measurement of the energy 
is then performed, the various values E n can be found, with probabilities propor¬ 
tional to: 




if n is odd 
if n is even 


(29) 


Using this incorrect argument, one would find the probabilities of all values of E n 
corresponding to odd n to be equal. This is absurd, since the sum of these 
probabilities would then be infinite. 

This error results from the fact that we have not taken the norm of the wave 
function into account. To apply the fourth postulate of chapter III correctly, it is 
necessary to write the wave function as normalized just after the first measurement. 
However it is not possible to normalize the function <5(x — a/2)*. The problem 
posed above must be stated more precisely. 

As we saw in §E-2-b of chapter III, an experiment in which the measurement 
of an observable with a continuous spectrum is performed never yields any result 
with complete accuracy. For the case with which we are concerned, we can only 
say that: 


a 

2 


s a e 

l* x *2 + 2 


(30) 


where e depends on the measurement device used but is never zero. 

If we assume e to be much smaller than the extension of the wave function 
before the measurement (here a), the wave function after the measurement will be 


practically x/e <5 <£| ^x — [<5 <£, (x) is the null function everywhere except in the 


interval defined in (30), where it takes on the value 1/e; cf. appendix II, §l-a]. 
This wave function is indeed normalized since: 


j* dx \fe (5 <£) ^x ~ = * 


* We see concretely in this example that a ^-function cannot represent a physically realizable state. 
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What happens now if the energy is measured ? Each value E n can be found 
with the probability: 

If /- V „\ 12 

W„) = 


<p*(x) \fe <5 <C) ^X - dx 

8u / 1 \ 2 . 2 (nne\ . 

. — — 1 sin 1 * -r— ) if n is odd 
= e ynnJ ^ 2a J 


if n is even 


(32) 


The variation with respect to n of &(E n ), for fixed e and odd n, is shown in 
figure 7. This figure shows that the probability ^(EJ becomes negligible when « 
is much larger than a/e. Therefore, however small s may be, the distribution of 
probabilities E n ) depends strongly on e. This is why, in the first argument, 
where we set e = 0 at the beginning, we could not obtain the correct result. We also 
see from the figure that the smaller e is, the more the curve extends towards large 
values of n. The interpretation of this result is the following: according to 
Heisenberg’s uncertainty relations (c/. chap. I, §C-3), if one measures the position 
of the particle with great accuracy, one drastically changes its momentum. Thus 
kinetic energy is transferred to the particle, the amount increasing as & decreases. 



FIGURE 7 

Variation with n of the probabi¬ 
lity &(E n ) of finding the energy E n 
after a measurement of the par- 
tide’s position has yielded the 
result a/2 with an accuracy of e 
(e 4 a). The smaller e, the greater 
the probability of finding high 
energy values. 
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Complement B| U 

STUDY OF THE PROBABILITY CURRENT 
IN SOME SPECIAL CASES 


1. Expression for the current in constant potential regions 

2. Application to potential barrier problems 

a) Case where E > V 0 

b) Case where E < V 0 

3. Probability current of incident and evanescent waves, in the case of reflection from a two-dimensional 
potential step 


The probability current associated with a particle having a wave function ^(r, t) 
was defined in chapter III by the relation: 

J(r ’ t]= i ^* (r ’ 0 v * (r ’ l) ~ cc -i (1) 

(where c.c. is an abbreviation for complex conjugate). In this complement, we 
shall study this probability current in greater detail in some special cases: one- and 
two-dimensional “square” potentials. 


1. Expression for the current in constant potential regions 


Consider a one-dimensional problem, with a particle of energy E placed in a 
constant potential V 0 . In complement H„ we distinguished between several cases. 
(/) When E > V 0 , the wave function is written: 

i//(x) = A e ikx + A' e~ ikx (2) 


with: 


E-V 0 


w 

2m 


( 3 ) 


Substituting (2) into (1), we obtain : 


( 4 ) 


The interpretation of this.result is simple : the wave function given in (2) corresponds 
to two plane waves of opposite momenta p = ± Tik with probability densities \A\ 2 
and \A'\ 2 . 

(ii) When E < V 0 , we have: 

= Be px + B'e~ px (5) 

with: 



( 6 ) 
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Substituting (5) into (1), we obtain: 

J = — \iB*B' + c.c.l (7) 

m 

In this case, we see that the two exponential waves must both have non-zero 
coefficients for the probability current to be non-zero. 


2. Application to potential barrier problems 

Let us apply these results to the potential barrier problems studied in 
complements H, and J,. We shall therefore consider a particle of mass m and 
energy E propagating in the Ox direction and arriving at x = 0 at a potential step 
of height V 0 (fig. 1). 



a. CASE WHERE £> V 0 

Apply formula (4) to wave functions (11) and (12) of complement H,, setting, 
as in that complement: 

A’ 2 = 0 (8) 

In region I, the probability current is: 


I 2 -Mil 2 ] 


and in region II: 

•'u=^W 2 (10) 

/, is the difference of two terms, the first one corresponding to the incident current 
and the second, to the reflected current. The ratio of these two currents gives the 
reflection coefficient R of the barrier: 

R = l4^f ( 11 ) 


which is precisely formula (15) of complement H,. 


281 




COMPLEMENT B 


in 


Similarly, the transmission coefficient T of the barrier is the ratio of the 
transmitted current J n to the incident current; we therefore have: 




( 12 ) 


and we again find relation (16) of complement H,. 


b. CASE WHERE E < V 0 

Since the expression for the wave function (pjx) is the same as in §a, rela¬ 
tion (9) is still valid. However, in region II, the wave function is: 

<p u {x) = B' 2 e~* x (13) 

[since, in equation (20) of complement H„ B 2 = 0]. Using (7), we thus obtain: 

J„ = 0 (14) 

The transmitted flux is zero, as is consistent with relation (24) of H,. 

How should we interpret the fact that, in region II, the probability current is 
zero while the probability of finding the particle in this region is not? Let us refer to 
the results obtained in §1 of complement J,. We saw that part of the incident wave 
packet enters the classically forbidden region II and then turns back before setting 
out again in the negative x direction (this incursion into region II being responsible 
for the delay upon reflection). In the steady state, we shall therefore have two 
probability currents in region II: a positive current corresponding to the entrance 
into this region of part of the incident wave packet; a negative current corresponding 
to the return towards region I of this part of the wave packet. These two currents 
are exactly equal, so the overall result is zero. 

In the case of a one-dimensional problem, the structure of the probability 
current of the evanescent wave is therefore masked by the fact that the two 
opposite currents balance. This is why we are going to consider a two-dimensional 
problem, for the case of oblique reflection, so as to obtain a non-zero current and 
interpret its structure. 


3. Probability current of incident and evanescent waves, 

in the case of reflection from a two-dimensional potential step 


We shall consider the following two-dimensional problem : a particle of 
mass m, moving in the xOy plane, has a potential energy V (x, y) which is independent 
of y and given by: 
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V{x, y) = 0 
V(x, y) = V 0 


if x < 0 
if x > 0 


( 15 ) 
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The present case corresponds to the one studied in § 2 of complement F,: 
the potential energy V(x, y) is the sum of a function Fj(x) (potential energy of a 
one-dimensional step) and a function V 2 (y), which is zero here. We can therefore 
look for a solution of the eigenvalue equation of the Hamiltonian in the form of a 
product: 

(p(x,y) = <Pi(x)<p 2 (y) (16) 


The functions ^(x) and (p 2 (y) satisfy one-dimensional eigenvalue equations which 
correspond respectively to F t (x) and V 2 (y) and to energies E x and E 2 such that: 

E x + E 2 = E (total energy of the particle) (17) 


We shall assume E 1 < V 0 : the equation which gives <j»i(x) therefore 
corresponds to total reflection in a one-dimensional problem, and we can use 
formulas (11) and (20) of complement H,. As for the function <p 2 {y), it can be 
obtained immediately since it corresponds to the case of a free particle ( V 2 = 0): 
it is a plane wave. We therefore have, in region I (x < 0): 


(p t (x, y) = A Q‘< k ^ x+k y^ + A' e i( ~ k * x+kyy) 


with: 



(18) 


(19) 


and, in region II (x > 0): 

<p„(x, y) = Be~ pxX e ikyy 

with: 

= /2m(F 0 - £ t ) 

Px V * 2 


( 20 ) 


( 21 ) 


Equations (22) and (23) of H, give us the ratios A'/A and B/A. Introducing the 
parameter 6 defined by: 


tan 0 


£*„ !K - E > . 

K V E . ' 


0 < 0 


( 22 ) 


we obtain: 


^ _ K \Px _ e - 2<e 

A k x + ip x 


(23) 


and: 


B_ 

A 


2 K 

k x + ip x 


2 cos 0 e ie 


(24) 
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m 


^ Let us apply relation (1), which defines the probability current. We obtain, 
in region I: 


Jr 


Tik 

(J,\ = —|A e <kxX + A' e ~ ikxX \ 2 


Vi}? 

= -£\A\ 2 [2 + 2 cos (2 k x x +26)-] 


(25) 


and in region II: 


f W, = 0 


4 Ul 2 cos 2 0 e 2pxJt 
m 1 1 


(26) 


In region I, only the (/,)>, component of the probability current is non-zero; 
this component is the sum of two terms: 

— the term proportional to 2|^4| 2 which results from the sum of the currents 
of the incident and reflected waves (cf. fig. 2); 



FIGURE 2 

The sum of the probability currents associated with 
the incident and reflected waves yields a probability 
current parallel to Oy. 


— the term containing cos (2 k x x + 20), which represents an interference 
effect between the two waves; it is responsible for the oscillation of the probability 
current with respect to x (cf. fig. 3). 



Because of the interference between the incident and reflected waves, the probability current in 
region I is an oscillatory function of x ; in region II, it decreases exponentially (evanescent wave). 
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In region II, the probability current is again parallel to Oy. Its exponential 
decay corresponds to the decay of the evanescent wave. This probability current 
arises from the fact that the wave packets do enter the second region (cf. fig. 4) 
and, before turning back, propagate in the Oy direction for a time of the order of the 
reflection delay t [cf complement J,, equation (10)]. This penetration is also related 
to the lateral shift of the wave packet upon reflection (cf. fig. 4). 




FIGURE 4 

The penetration of the particle into region II leads to a lateral shift upon 
reflection. 
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Complement C m 

ROOT-MEAN-SQUARE DEVIATIONS 
OF TWO CONJUGATE OBSERVABLES 

1. The uncertainty relation for P and Q 

2. The “minimum” wave packet 

Two conjugate observables P and Q are two observables whose commutator [Q, P] 
is equal to ih. We shall show in this complement that the root-mean-square 
deviations (cf. §C-5 of chapter III) AP and AQ, for any state vector of the system 
under study, satisfy the relation: 

AP.AQZ* ( 1 ) 

We shall then show that if the system is in a state where the product AP . AQ 
is exactly equal to ft/ 2, the wave function associated with this state in the { | q > } 
representation is a Gaussian wave packet (as is the wave function in the { | p} } 
representation). 

1. The uncertainty relation for P and Q 

Consider the ket: 

|<p> = «2 + a/>)|^> (2) 

where A is an arbitrary real parameter. .For all A, the square of the norm < <p \ (p > 
is positive. This is written : 

< (P | <P > = < | (Q ~ HP) (Q + i2P) | * > 

= <<H(2 2 k> + <<H WQP ~ UPQ) I ^ | * 2 p 2 1 1 > 

= <Q 2 y + a<\Q, J p]> + A 2 <i> 2 > 

= < Q 2 > - M + a 2 < P 2 > ^ 0 (3) 

The discriminant of this expression, of second order in A, is therefore negative or 
zero: 

n 2 — 4 < P 2 > < Q 2 > < 0 (4) 

and we have: 

<P 2 XQ 2 >>^ (5) 

Assuming | ij/ > to be given, let us now introduce the two observables Q' 
and P' defined by : 

p = p~ < \Py = p~(.^\p\*f/y 

Q' = Q - < Q >• = Q - < * I Q I * y (6) 


286 




ROOT-MEAN-SQUARE DEVIATIONS OF TWO CONJUGATE OBSERVABLES 


P' and Q' are also conjugate observables, since we have: 

[Q', n = IQ, />] = ft ( 7 ) 

Result (5), obtained above for P and Q, is therefore also valid for P' and Q ': 

< P' 2 > < Q' 2 > £ ^ (8) 

In addition, referring to definition (C-23) (chap. Ill) of the root-mean-square 
deviation and using (6), we see that: 

AP = v\-p ,r > 

AQ = v/< Q ' 2 > (9) 

Relation (8) can therefore also be written : 

( 10 ) 

Thus, if two observables are conjugate (as is the case when they correspond to a 
classical position x, and its conjugate momentum p { ), there exists an exact lower 
bound for the product AP. AQ. We thus generalize the Heisenberg uncertainty 
relation. 

comment : 

This argument can easily be generalized to two arbitrary observables A 
and B. One obtains : 

AA.AB>j\<[A,B]>\ (11) 


2. The “minimum" wave packet 

When the minimum value of the product AP . AQ is attained: 

AP .AQ =- 2 (12) 

the state vector | \j/ ) is said to correspond to a minimum wave packet for the 
observable? P and Q. 

According to the preceding argument, relation (12) requires that the square 
of the norm of the ket: 

| <P'> = (Q' + UP') I * > (13) 

be a second-order polynomial in A with a double root A 0 . When A = A 0 , the ket | <p' ) 
is therefore zero: 

(Q' + a 0 n I * > = IQ - < Q > + Ho(P ~ < p »] I * > = 0 ( 14 ) 



287 






COMPLEMENT C 



On the other hand, if AP . AQ > h/2, the polynomial which gives < q>’ \ <p' > can 
never be equal to zero (it is positive for all A). 

Therefore, the necessary and sufficient condition for the product AP . AQ to 
take on its minimum value h/2 is that the kets (Q — <(?>)!»/'> and (P — (P ))!</') 
be proportional. The proportionality coefficient — iX 0 can easily be calculated. 
When AQ . AP = h/2, the equation : 

<(p'\<p'> = X 2 (AP) 2 - Xh + (AQ) 2 =0 (15) 


has for its double root 


2 (AP) 2 


2(d0 2 

n 


Let us write relation (14) in the { | q > } representation (for simplicity, we 
assume the eigenvalues q of Q to be non-degenerate). Using the fact {cf. comple¬ 
ment E„) that in this representation, P acts like 4 4 -, we obtain: 

i d q 


q + nx o • 


< Q > - Mo < P > kte) = 0 


4>{q) = < q I ^ > 


To integrate equation (17), it is convenient to introduce the function 6{q) 
defined by: 

Hq)= e i<p >« /n d(q - <g» (19) 

Substituting (19) into (17), we thus obtain a more simple equation : 


q + X 0 h± 6{q) = 0 


whose solution is: 


6(q) = Ce-’W 


(where C is an arbitrary complex constant). Substituting (16) and (21) into (19), 
we obtain: 

„ r«-«2> i2 

ip(q) = C e i<p>q,h el 2 aq } (22) 

This function can be normalized by setting: 

C = | -2n(AQ) 2 yv* (23) 

We thus arrive at the following conclusion: when the product AP . AQ takes 
on its minimum value h/2, the wave function in the { | q > } representation is a 
Gaussian wave packet, obtained from the Gaussian function 9(q) by transforma¬ 
tion (19) (which is equivalent to two changes of the origin, one on the #-axis and 
one on the p-axis). 
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COMMENT : 

This argument in the { | q ) } representation can be_repeated in the {Ip)} 
representation. One then finds that the wave function i l>(p) defined by : 

MP) = < P | ^ > = —= f dq Q~ ipqlh Hq) (24) 

V 2nh J-oo 

is also a Gaussian function, given by: 

_ fP-<P> l2 

= [27t(dP) 2 ] _1/4 e -i<G>p/,i e L 2 jp J ( 25 ) 

up to a phase factor exp (i < Q > < P }/h). 
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MEASUREMENTS BEARING ON ONLY ONE PART 
OF A PHYSICAL SYSTEM 


1. Calculation of the physical predictions 

2. The physical meaning of a tensor product state 

3. The physical meaning of a state which is not a tensor product 


The concept of a tensor product, introduced in §F of chapter II, enabled us to see 
how to construct, starting with the state spaces of two subsystems, that of the global 
system obtained by considering them together. We intend to pursue this study here, 
using the postulates of chapter III to see what results can be obtained, when the 
state of the global system is known, from measurements bearing on only one 
subsystem. 


1. Calculation of the physical predictions 

Consider a system composed of two parts (1) and (2) (for example, a system 
of two electrons). If £(l) and if (2) are the state spaces of parts (1) and (2), the 
state space of the global system (1) + (2) is the tensor product £(\) ® <#’(2). For 
example the state of a two-electron system is described by a wave function 
of six variables, , z x ; x 2 , y 2 , associated with a ket of <f r (l)® £,(2) 

(cf chap. II, §F-4-b). 

It is possible to imagine measurements which bear on only one of the two 
parts [part (1), for example] of the global system. The observables ^1(1) correspon¬ 
ding to these measurements are defined in £(l) ® £(2) by extending the obser¬ 
vables A{1) acting only in £(\)* (cf chap. II, § F-2-b) : 

A(\) A(\) = A(\)®^(2) (1) 

whereII (2) is the identity operator in £(2). 

The spectrum of ,4(1) in £(l ) ® £(2 )is the same as that of A(l)in £(l ).On the 

other hand, we have seen that all the eigenvalues of ,4(1) are degenerate 
in <f(l)® £(2), even if none of the eigenvalues of ,4(1) is degenerate in £(l) 
[on the condition, of course, that the dimension of £(2) be greater than 1]. When a 
measurement is made on system (1) alone, the global system may therefore be in 
several different states after the measurement, whatever the result (the state after the 
measurement depends not only on the result but also on the state before the 
measurement). From a physical point of view, this multiplicity of states corresponds 
to the degrees of freedom of system (2), about which no information is sought 
in the measurement. 


* For the sake of clarity, we shall adopt throughout this complement different notations for ^4(1) 
and its extension A( 1). 
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Let /^(l) be the projector, in <?(1), onto the eigensubspace related to the 
eigenvalue a n of A(l ): 

Pn( 1)= I |<0))<tl'(l)| (2) 

i= 1 

where the kets | w*(1)> are g„ orthonormal eigenvectors associated with a n . 
Let Pjl) be the projector, in S(\ ) ® S(2), onto the eigensubspace related to the 
same eigenvalue a n of .3(1). P„(l) is obtained by extending P n {\) into S’(l ) ® S(2): 

^.(l) = F.(l)®1l(2) (3) 

To write the identity operator 11(2) of S(2), let us use the closure relation for an 
arbitrary orthonormal basis { | r k (2)> } of S(2 ): 

11(2) = X \v k (2) > < v k (2) | (4) 

k 

Substituting (4) into (3) and using (2), we obtain : 

~ 9n 

P n ( D= Z 11 <(D^(2) ><<(1)^(2) | (5) 

i — 1 k 

Thus, knowing the state | ^ > of the global system (assumed to be norma¬ 
lized to 1), we can calculate the probability & (l) (a n ) of finding the result a„ in a 
measurement of A(l ) on part (1) of this system. Using general formula (B-14) 
of chapter III, which here gives: 

0* l) (a n ) = (*\P n (1) | 0 > (6) 

we find: 

^>„)= Z ZK«i(i)^(2)K>| 2 (7) 

1=1 k 


Similarly, the state 1 1 (/'} of the system after the measurement can be calculated; 
according to formula (B-31) of chapter III, it is given by: 


<A' > = 


^» 0 ) *> 





COMMENTS : 

(/) The choice of an orthonormal basis, { | v k (2)) }, in «?(2) is arbitrary. We see 
from (3), (6) and (8) that the predictions concerning subsystem (1) do not 
depend on this choice. Physically, it is clear that if no measurement is 


291 



COMPLEMENT D 


performed on system (2), no state or set of states of this system can 
play a preferential role. 

(//) If the state | ip > before the measurement is a tensor product: 

l*> = I <?(!)> ® |X(2)> (10) 

[where | <p(l)> and | y(2 )> are two normalized states of S(l) and <?(2)], it is 
easy to see, using (3) and (8), that the state | rp' ) is also a tensor product: 

I r> = i«P'(t)>@|x(2)> (H) 

with: 

(p 

The state of system (1) has therefore changed, but not that of system (2). 

(/ii) If the eigenvalue a n of A(\) is non-degenerate in S(1) — or, more generally, 
if ^4(1) actually represents a complete set of commuting observables of S(\) — 
the index i is no longer necessary in formula (2) and those which follow. 
The state of the system after a measurement yielding the result a n can always 
be put in the form of a product of two vectors. This can be seen by writing 
relation (9) in the form : 

\r> = ko>>® | x'(2)> (i3) 

where the normalized vector | % '(2)> of S(2) is given by: 

Z I v k ( 2 ) > < M n(lK(2) I > 

| X'(2) > = — p======== =~ (14) 

./ZI< M »( 1 W2)K>| 2 

V k 

Therefore, whatever the state | ip > of the global system before the measure¬ 
ment, the state of the system after a measurement bearing on part (1) alone 
is always a tensor product when this measurement is complete with respect 
to part (1) [although partial as regards the global system (1) + (2)]. 


2. The physical meaning of a tensor product state 

To see what a product state represents physically, let us apply the results of 
the preceding paragraph to the particular case where the initial state of the global 
system is of the form (10). We immediately obtain, using (6) and (3): 

= < <P0)X(2) | P.(l) ® H(2) | <p(l)x(2) > (15) 

The very definition of the tensor product P„(l) ®H (2) and the fact that |x(2)> is 
normalized then allow us to write: 

= <<p(I)| P„(l) | </>(!)>< x(2) |1l(2) | X (2) > 

= <<?(!) | ^,(1)1 <?(!)> (16) 


'0)> = 


P„(l)l <?(!)> 


%/Op(1)T^(0 | <?(!)> 


( 12 ) 
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0 >{l \a u ) does not depend on |*(2)>, but only on |<p(l)>. When the state of the 
global system has the form (10), all physical predictions relating to only one 
of the two systems do not, therefore, depend on the state of the other one and are 
expressed entirely in terms of |p(l)> [or of |x(2)>], depending on whether it is 
system (1) alone [or system (2) alone] that is being observed. 

A product state | <jo(l)) ® | y(2) > can therefore be considered to represent the 
simple juxtaposition of two systems, one in the state |<p(l)> and the other in 
the state \ % (2) >. In such a state, the two systems are also said to be uncorrelated 
(more precisely, the results of the two types of measurements, bearing either on one 
system or on the other, correspond to independent random variables). Such a situation 
is realized when the two systems have been separately prepared in the states | <p(l )> 
and | y(2)) and then united without interacting. 


3. The physical meaning of a state which is not a tensor product 


Now consider the case in which the state of the global system is not a product 
state, that is, where | > cannot be written in the form |<p(l)> ® |x(2)>. The 

predictions of measurement results bearing on only one of the two systems can 
then no longer be expressed in terms of a ket | <p(l )> [or |x(2)>] in which 
system (1) [or (2)] would be found. In this case, general formulas (6) and (7) must 
be used to find the probabilities of the various possible results. We assume here 
without proof that such a situation generally reflects the existence of correlations 
between systems (1) and (2). The results of measurements bearing on either 
system (1) or system (2) correspond to random variables which are not independent 
and can therefore be correlated. It can be shown, for example, that an interaction 
between the two systems transforms an initial state which is a product into one which 
is no longer a product: any interaction between two systems therefore introduces, 
in general, correlations between them. 

When the state of the global system is not a product |<p(l)> ® |y(2)>, how 
can each partial system (1) or (2) be characterized, since the ket | <p( 1) > or | y(2)> 
can no longer be associated with it? This question is very important since, in 
general, every physical system has- interacted with others in the past (even if it is 
isolated at the instant when it is being studied). The state of the global system : 
{ system (1) + systems (2) with which it has interacted in the past} is therefore not 
in general a product state, and it is not possible to associate a state vector | <p(l)> 
with system (1) alone. To resolve these difficulties, one must describe system (1), 
not by a state vector, but by an operator, called the density operator. Some 
indication of the corresponding formalism, fundamental to statistical quantum 
mechanics, is given in complement E,„ (§5-b). 

However, system (1) can always be described by a state vector when a 
complete set of measurements has been performed on it. We have seen that 
whatever the state of the global system (1) + (2) before the measurement, 
a complete measurement on system (1) places the global system in a product 
state [cf. formulas (13) and (14)]. The vector associated with (1) is the unique 
eigenvector (to within a multiplicative factor) associated with the results of the 
complete set of measurements done on it. This set of measurements has therefore 
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erased all correlations resulting from previous interactions between the two systems. 
If, at the moment of measurement, system (2) is already far away and no longer 
interacting with system (1), it can then be completely forgotten. 

COMMENT : 

It is easy to deduce from (14) that, when the state | \j /) before the 
measurement is not a product state, the state vector | % '(2) > associated with 
system (2) after the measurement depends on the result of the complete 
set of measurements made on system (1) [recall that this is not the case 
when 1> is a product state; cf comment (it) of § 1 ]. This result may at first 
seem surprising: the state of system (2) after a set of measurements performed 
on system (1) does depend on the result of these measurements, even if 
system (2), at the moment of measurement, is already very far away from 
system (1) and no longer interacting with it. To this “paradox”, studied in 
detail by certain physicists, are attached the names of Einstein, Podolsky 
and Rosen. 


References and suggestions for further reading : 

The Einstein-Podolsky-Rosen paradox : see the subsection “ Hidden variables and 
paradoxes” of section 5 of the bibliography; Bohm (5.1), §§22.15 to 22.19; 
d’Espagnat (5.3), chap. 7. 

Photons produced in the decay of positronium : Feynman III (1.2), § 18.3; Dicke 
and Wittke (1.14), chap. 7. 
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THE DENSITY OPERATOR 


1. Outline of the problem 

2. The concept of a statistical mixture of states 

3. The pure case. Introduction of the density operator 

a. Description by a state vector 

b. Description by a density operator 

c. Properties of the density operator in a pure case 

4. A statistical mixture of states (non-pure case) 

a. Definition of the density operator 

b. General properties of the density operator 

c. Populations; coherences 

5. Applications of the density operator 

a. System in thermodynamic equilibrium 

b. Separate description of part of a physical system. Concept of a partial trace 


1. Outline of the problem 

Until now, we have considered systems whose state is perfectly well known. 
We have shown how to study their time evolution and how to predict the results 
of various measurements performed on them. To determine the state of a system 
at a given instant, it suffices to perform on the system a set of measurements 
corresponding to a C.S.C.O. For example, in the experiment studied in §A-3 of 
chapter I, the polarization state of the photons is perfectly well known when 
the light beam has traversed the polarizer. 

However, in practice, the state of the system is often not perfectly deter¬ 
mined. This is true, for example, of the polarization state of photons coming from 
a source of natural (unpolarized) light, and also for the atoms of a beam emitted by 
a furnace at temperature T, where the atoms’ kinetic energy is known only statistic¬ 
ally. The problem posed by the quantum description of such systems is the following: 
how can we incorporate into the formalism the incomplete information we possess 
about the state of the system, so that our predictions make maximum use of this 
partial information? To do this, we shall introduce here a very useful mathematical 
tool, the density operator, which facilitates the simultaneous application of the 
postulates of quantum mechanics and the results of probability calculations. 


2. The concept of a statistical mixture of states 

When one has incomplete information about a system, one typically appeals 
to the concept of probability. For example, we know that a photon emitted by 
a source of natural light can have any polarization state with equal probability. 
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Similarly, a system in thermodynamic equilibrium at a temperature T has a 
probability proportional to e~ E » lkT of being in a state of energy E n . 

More generally, the incomplete information one has about the system usually 
presents itself, in quantum mechanics, in the following way : the state of this 
system may be either the state with a probability p i or the state 1 1 p 2 } with 
a probability p 2 , etc... Obviously : 

Pi + Pi + ... = Ip* = 1 (1) 

k 

We then say that we are dealing with a statistical mixture of states | i{/ l ), — 

probabilities p t ,p 2 . 

Now let us see what happens to the predictions concerning the results of 
measurements performed on this system. If the state of the system were | ij / k >, 
we could use the postulates stated in chapter III to determine the probability of 
obtaining one or another measurement result. Since such a possibility (the 
state | \p k >) has a probability of p k , it is clear that the results obtained must be 
weighted by the p k and then summed over the various values of k, that is, over 
all the states of the statistical mixture. 

comments : 

(/) The various states | ), |i p 2 >, ... are not necessarily orthogonal. However, 

they can always be chosen normalized; in this complement, we shall assume 
that this is the case. 

(ii) It must be noted that in the present case, the probabilities intervene on 
two different levels: 

— first, in the initial information about the system (until now, we have 
not introduced probabilities at this stage: we considered the state vector to 
be perfectly well known, in which case all the probabilities p k are zero, 
except one, which is equal to 1); 

— then, when the postulates concerning the measurement are applied 
(leading to probabilistic predictions, even if the initial state of the system 
is perfectly well known). 

There are thus two totally different reasons necessitating the introduction 
of probabilities on these two levels: the incomplete nature of the initial 
information about the state of the system (such situations are also envisaged 
in classical statistical mechanics), and the (specifically quantum mechanical) 
uncertainty related to the measurement process. 

(iii) A system described by a statistical mixture of states (with the probability p k 
of the state vector being | ift k )) must not be confused with a system whose 
state | tp ) is a linear superposition of states*: 

K > = Z c k | tp k > (2) 

k 

It is often said in quantum mechanics, when the state vector is the ket | ip > 
given in (2), that “the system has a probability |c k | 2 of being in the state 1 ip k > ”. 


* We assume, in this comment (iii), that the states | i// t > are orthonormal. This hypothesis is not 
essential but it simplifies the discussion. 
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If we want to be precise, this must be understood to mean that if we perform 
a set of measurements corresponding to a C.S.C.O. which has | if/ k > as an 
eigenvector, the probability of finding the set of eigenvalues associated 
with | \j/ k > is |c t | 2 . But we have stressed, in §E-1 of chapter III, the fact that 
a system in the state | ip > given in (2) is not simply equivalent to a system having 
the probability IcJ 2 of being in the state |^j>, |c 2 | 2 of being in the 
state etc... In fact, for a linear combination of | \p k >, there exist, in 

general, interference effects between these states (due to cross terms of the 
type c k c£, obtained when the modulus of the probability amplitudes is 
squared) which are very important in quantum mechanics. 

We therefore see that it is impossible, in general, to describe a statis¬ 
tical mixture by an “average state vector” which would be a superposition 
of the states | ip k >. As we indicated earlier, when we take a weighted sum of 
probabilities , we can never obtain interference terms between the various 
states | ip k ) of a statistical mixture. 


3. The pure case. Introduction of the density operator 

To study the behavior of a statistical mixture of states, we have envisaged 
one method: calculation of the physical predictions corresponding to a possible 
state | \p k >; weighting the results so obtained by the probability p k associated with 
this state and summation over k. Although correct in principle, this method often 
leads to clumsy calculations. We have indicated [comment (/«)] that it is impossible 
to associate an “average state vector” with the system. Actually, it is an “average 
operator” and not an “average vector” which permits a simple description of the 
statistical mixture of states: the density operator. 

Before studying this general case, we shall begin by again examining the 
simple case where the state of the system is perfectly known (all the probabilities p k 
are zero, except one). The system is then said to be in a pure state. We shall show 
that characterizing the system by its state vector is completely equivalent to 
characterizing it by a certain operator acting in the state space, the density operator. 
The usefulness of this operator will become apparent in §4, where we shall show that 
nearly all the formulas involving this operator, and derived for the pure case, remain 
valid for the description of a statistical mixture of states. 

a. DESCRIPTION BY A STATE VECTOR 

Consider a system whose state vector at the instant t is: 

| 'l'(t) > = Z C n( t ) | > (3) 

n 

where the { | u n ) } form an orthonormal basis of the state space, assumed to be 
discrete (extension to the case of a continuous basis presents no difficulties). The 
coefficients c„{t) satisfy the relation : 

I W0| ! = 1 (4) 

n 

which expresses the fact that |i^(f)> is normalized. 
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If A is an observable, with matrix elements: 

<u„\A\u p y = A„ p (5) 

the mean value of A at the instant t is : 

< A > (0 = < <A(0 | A | i]/(t) > = £ c *(0 c p( 0 A np (6) 

n,p 

Finally, the evolution of | > is described by the Schrodinger equation: 

= m\m> o) 

where H(t ) is the Hamiltonian of the system. 

b. DESCRIPTION BY A DENSITY OPERATOR 

Relation (6) shows that the coefficients c n (t) enter into the mean values 
through quadratic expressions of the type c*(t)c p (t). These are simply the matrix 
elements of the operator | if/(t) ) < ip(t) |, the projector onto the ket | ij/(t) > (cf. chap. II, 
§B-3-b), as can be seen from (3): 

<u p \'l'(t)X'l'(t)\u n > = ct(t)c p (t) ( 8 ) 

It is therefore natural to introduce the density operator p(t ), defined by: 

p(0= I*(r)><*(0| (9) 

The density operator is represented in the { | u H > } basis by a matrix called the 
density matrix whose elements are : 

Ppn{t) = <« P M')|«„> = c*(t)c p (t) (10) 

We are going to show that the specification of p(t) suffices to characterize 
the quantum state of the system; that is, it enables us to obtain all the physical 
predictions that can be calculated from | *j/(t) >. To do this, let us write formulas (4), 
(6) and (7) in terms of the operator p(t). According to (10), relation (4) indicates 
that the sum of the diagonal elements of the density matrix is equal to 1: 

£ M0| 2 = £ PJt) = Tr p{t) =1 (11) 

n n 

In addition, using (5) and (10), formula (6) becomes: 

< /l > (0 = £ < u p I p(t) | u„ > < u„ | A | u p > 

n y p 

= £ < “ P \p{t)A | u p > 

P 

= Tr { P(I)A ) (12) 
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Finally, the time evolution of the operator p(t ) can be deduced from the Schrodinger 
equation (7): 

If P ^ = (it I ^ < M) | + I '!'(*) > ^ < MO 


= ^ H ( f ) I MO > < MO I + (Z^j I MO > < MO I "(0 

= ^ im MO] (13) 

Therefore, in terms of the density operator, conservation of probability 
is expressed by: 

Tr p{t) = 1 (14) 

The mean value of an observable A is calculated using the formula : 

< ^ > (0 = Tr { Ap(t) } = Tr { p(t)A } (15) 

and the time evolution obeys the equation : 

(i6) 


For completeness, we must also indicate how to calculate from p(t) the proba¬ 
bilities &(a„) of the various results a„ which can be obtained in the measurement 
of an observable A at time t. Actually, formula (15) can be used to do this. 
We know [see equation (B-14) of chapter III] that ^{a n ) can be written as a mean 
value, that of the projector P„ onto the eigensubspace associated with a n : 

&(<*„) = <iHt)\P n \iHt)> (17) 

Using (15), we therefore obtain : 

0>(a„) = Tr{P n p(t)} (18) 


c. PROPERTIES OF THE DENSITY OPERATOR IN A PURE CASE 

In a pure case, a system can be described just as well by a density operator 
as by a state vector. However, the density operator presents a certain number of 
advantages. 

First of all, we see from (9) that two state vectors |i/'(f)) and e‘®|t/'(/)> 
(where 6 is a real number), which describe the same physical state, correspond to the 
same density operator. Using this operator therefore eliminates the drawbacks 
related to the existence of an arbitrary global phase factor for the state vector. 
Moreover, we see from (14), (15) and (18) that the formulas using the density 
operator are linear with respect to it, while expressions (6) and (17) are quadratic 
with respect to |»A(f)>. This is an important property which will be useful subse¬ 
quently. 
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Finally, let us mention some properties of p{t), which can be deduced 
directly from its definition (9): 

p'{t) = p(t) (19) 

(the density operator is Hermitian) 

P 2 ( 0 = P{t) ( 20 ) 

Tr p 2 (t) = 1 (21) 

These last two relations, which follow from the fact that p(t) is a projector, are 
true only in a pure case. We shall see later that they are not valid for a statistical 
mixture of states. 


4. A statistical mixture of states (non-pure case) 

a. DEFINITION OF THE DENSITY OPERATOR 

Let us now return to the general case described in § 1, and consider a system 
for which (at a given instant) the various probabilities , p 2 ,... p k ,... are arbitrary, 
on the condition that they satisfy the relations: 

0 ^ Pi > Pi > •••* Pk »••• ^ ^ (22) 

I I Pk = 1 

k 

Under these conditions, how does one calculate the probability 0*(a n ) that a mea¬ 
surement of the observable A will yield the result a n ? 

Let: 

3(«.)-(tJ'.lfc) (23) 


be the probability of finding a n if the state vector were | \j/ k >. To obtain the desired 
probability £P(a n ), one must, as we have already indicated, weight t? k (a n ) by p k and 
then sum over k : 

na n ) = Ip* P k (a n ) (24) 


Now, from (18), we have: 

&M n ) = Tr { P k P n ) (25) 

where: 

Pk= I'l'kX'l'kl (26) 

is the density operator corresponding to the state | ij / k ). Substituting (25) into (24), 
we have: 

k 

= Tr{I p kPk P n } 

k 

= Tr { pP n } (27) 
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where we have set: 

P = l PkP k ( 28 ) 

k 

We therefore see that the linearity of the formulas which use the density 
operator enables us to express all physical predictions in terms of p, the average of 
the density operators p k ; p is, by definition, the density operator of the system. 

b. GENERAL PROPERTIES OF THE DENSITY OPERATOR 

Since the coefficients p k are real, p is obviously a Hermitian operator like 
each of the p k . 

Let us calculate the trace of p ; it is equal to : 

Tr P = I>*Trp k (29) 

k 

Now, as we saw in § 3-b, the trace of p k is always equal to 1; it follows that : 

Trp = '£p k = l ( 30 ) 

k 

Relation (14) is therefore valid in the general case. 

We have already given, in (27), the expression which enables us to calculate 
the probability tP(a n ) in terms of p. Using this expression, we can easily general¬ 
ize formula (15) to statistical mixtures : 

= &[*«) = Tr { P I a n P n } 

n n 

= Tr { pA } (31) 

[we have used formula (D-36-b) of chapter II]. 

Now let us calculate the time evolution of the density operator. To do this, 
we shall assume that, unlike the state of the system, its Hamiltonian H(t) is per¬ 
fectly well known. One can then easily show that if the system at the initial time t 0 
has the probability p k of being in the state | 4> k >, then, at a subsequent time t, it has 
the same probability p k of being in the state | i{/ k (t) > given by : 

J •MO > = H{t) | \j/ k (t) > 


ll<M'o)> = \+ k > 

(32) 

The density operator at the instant t will then be: 

P(t) =IPk P k (t) 

(33) 

with: 

PM = 1 <M') > < 1 

(34) 

According to (16), p k (t) obeys the evolution equation : 

= [KI0.P.M] 

(35) 
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The linearity of formulas (33) and (35) with respect to p k (t) implies that: 

ih±p(t) = [H(t),p(t)] (36) 

We can therefore generalize to a statistical mixture of states all the equa¬ 
tions of § 3, with the exception of (20) and (21). We see that since p is no longer 
a projector, we have, in general* : 

P 2 * P (37) 

and, consequently: 

Tr p 2 < 1 (38) 

Nevertheless, only one of the equations, (20) or (21), must be satisfied for us to be sure 
that we are dealing with a pure state. 

Finally, we see from definition (28) that, for any ket |m>, we have: 

<«|p|m> = Z p* <«| p* I u > 

k 

= I Pk |< M | 'I'k >| 2 (39) 

k 

and consequently: 

< u | p | u > ^ 0 (40) 

p is therefore a positive operator. 

c. POPULATIONS; COHERENCES 

What is the physical meaning of the matrix elements p np of p in the { | u„ > } 
basis ? 

First, let us consider the diagonal element p nn . According to (28), we have: 
Pnn Z Pk LPk]nn (41) 

k 

that is, using (26) and introducing the components: 

cl k) = < «„ I •A* > (42) 

of | \f/ k > in the { | u n > } basis: 

Pnn = Z P* (43) 

k 

\c< k f is a positive real number, whose physical interpretation is the following: 
if the state of the system is | ij/ k ), it is the probability of finding, in a measurement, 
this system in the state | u n ). According to (41), if we take into account the 

* Assume, for example, that the states | <p k > are orthogonal. In an orthonormal basis including 
the | t/r* >, p is diagonal and its elements are the p k . To obtain p 2 , we simply replace p k by p k . Relations (37) 
and (38) then follow from the fact that the p k are always less than 1 (except in the particular case where 
only one of them is non-zero ; the pure case). 


302 




_ THE DENSITY OPERATOR 

indeterminacy of the state before the measurement, p nn represents the average 
probability of finding the system in the state | u„ >. For this reason, p nn is called the 
population of the state | u n >: if the same measurement is carried out N times 
under the same initial conditions, where N is a large number, Np nn systems will be 
found in the state | u n >. It is evident from ( 43 ) that p nn is a positive real number, 
equal to zero only if all the |c^ fc) | 2 are zero. 

A calculation analogous to the preceding one gives the following expression 
for the non-diagonal element p np : 

P np = IPkCc^* ( 44 ) 

k 

c (k) c (k)* j s a cross term, of the same type as those studied in §E-1 of chapter III. 
It expresses the interference effects between the states | u n ) and | u p ) which can 
appear when the state 1 1 j/ k y is a coherent linear superposition of these states. 
According to ( 44 ), p np is the average of these cross terms, taken over all the 
possible states of the statistical mixture. In contrast to the populations, p np can 
be zero even if none of the products cj, k> c p } * is : while p nn is a sum of real 
positive (or zero) numbers, p np is a sum of complex numbers. If p np is zero, this 
means that the average ( 44 ) has cancelled out any interference effects between | u n > 
and | w p ). On the other hand, if p np is different from zero, a certain coherence 
subsists between these states. This is why the non-diagonal elements of p are often 
called coherences. 


comments : 

(/) The distinction between populations and coherences obviously depends on 
the basis { | u n > } chosen in the state space. Since p is Hermitian, it is always 
possible to find an orthonormal basis { | Xi ) } where p is diagonal, p can 
then be written: 

P = Z n i Ui > < X, | ( 45 ) 


Since p is positive and Tr p = 1, we have: 


0 ^ 7t, ^ 1 

l n i = 1 


p can thus be considered to describe a statistical mixture of the states | Xi ) with 
the probabilities n, (there are no coherences between the states | *, >). 

(ii) If the kets | u n ) are eigenvectors of the Hamiltonian H, which is assumed to 
be time-independent: 

H\u„> = E„\u„y (47) 

we obtain directly from (36): 


tofapjt) = 0 

*7* Tf Pnpi*) = (E n ~ E p ) Pnp 
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that is: 

' pjt) = constant 

( 4 ») 

The populations are constant, and the coherences oscillate at the Bohr fre¬ 
quencies of the system. 

(iii) Using (40), one can prove the inequality: 

Pnn Ppp ^ l^npl (^) 

It follows, for example, that p can have coherences only between states whose 
populations are not zero. 


5. Applications of the density operator 

a. SYSTEM IN THERMODYNAMIC EQUILIBRIUM 

The first example we shall consider is borrowed from quantum statistical 
mechanics. Consider a system in thermodynamic equilibrium with a reservoir at the 
absolute temperature T. It can be shown that its density operator is then: 

p = Z~ l e~ HlkT (51) 

where H is the Hamiltonian operator of the system, k is the Boltzmann constant, 
and Z is a normalization coefficient chosen so as to make the trace of p equal to 1: 

Z = Tr { e~ HlkT } (52) 

(Z is called the “ partition function ”). 

In the { |m„> } basis of eigenvectors of H , we have (cf. complement B„, 
§ 4-a): 

Pnn = Z_1 < U n \ B,kT \ U n > 

= Z -1 e~ BJkT (53) 


and: 


Pnp = Z~ 1 < «„ | e~ H,kT | u p > 

= Z~ 1 Q~ B P /kT < u„ | u p > 

= 0 (54) 

At thermodynamic equilibrium, the populations of the stationary states are expo¬ 
nentially decreasing functions of the energy (the lower the temperature T, the 
more rapid the decrease), and the coherences between stationary states are zero. 
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b. SEPARATE DESCRIPTION OF PART OF A PHYSICAL SYSTEM. 

CONCEPT OF A PARTIAL TRACE 

We are now going to return to the problem mentioned in §3 of complement D,„. Let us 
consider two different systems (1) and (2) and the global system (1) + (2), whose state space 
is the tensor product: 

$ = S(\) ® ${2) (55) 

Let { | w n (l) > } be a basis of <f(l) and { | v p (2) > }, a basis of <f(2); the kets | u n ( 1) > | v p (2) > 
form a basis of 

The density operator p of the global system is an operator which acts in &. We saw 
in chapter II ( cf. § F-2-b) how to extend into $ an operator which acts only in S(\) [or (2)]. 
We are going to show here how to perform the inverse operation : we shall construct 
from p an operator p(l) [or p(2)] acting only in <£(1) [or <f(2)] which will enable us to make 
all the physical predictions about measurements bearing only on system (1) or system (2). 


This operation will be called a partial trace with respect to (2) [or (1)]. 

Let us introduce the operator p(l) whose matrix elements are: 

< u,(l) I p(l) I Ml) > = IK MO | < v p (2) \)p(\ MO > | v„(2) » (56) 

P 

By definition, p(l) is obtained from p by performing a partial trace on (2): 

p(l) = Tr 2 p (57) 

Similarly, the operator: 

p(2) = Tr, p (58) 

has matrix elements: 

< v p (2) | p(2) | v p .(2) > = I « »„(1) | < tfi) |)p (| «„(1) > | v p ,(2) » (59) 

n 

It is clear why these operations are called partial traces. We know that the (total) trace of p is : 

Tr P = II« MO I < M 2 ) I) p (I MO > I M 2 ) » ( 6 °) 

n p 


The difference between (60) and (56) [or (59)] is the following : for the partial traces, the indices n 
and n' (or p and p') are not required to be equal and the summation is performed only over p 
(or n). We have, moreover : 

Tr p = T fl (Tr 2 p) = Tr 2 (T ri p) (61) 

p(l) and p(2) are therefore, like p, operators whose trace is equal to 1. It can be verified from 
their definitions that they are Hermitian and, in general, that they satisfy all the properties of 
a density operator ( cf\ §4-b). 

Now let ,4(1) be an observable acting in &(\) and A( 1) = ^4(1) ® H (2), its extension in S. 
We obtain, using (31), the definition of the trace, and the closure relation on the {| u n ( 1)) | v p (2) > } 
basis: 

<i(l)> = Tr { pA{\) } 

= I I « MU I < M 2 )Pp(I Mi) > I «y(2) » 

n,p n'.p' 

X « M„.(l) | < M2)|M(1) ®H(2K| Ml) > I v p (2 )» 

= I « MU I < M 2 ) I) p(I Mi) > IM 2 ) » 

n,p,n'.p' 

x < «„.(!) | A(l) | «„(!) > < v p .(2) | v p {2) > (62) 
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Now: 

<V( 2 )lM 2 )> = V < 63 > 

We can therefore write (62) in the form: 

< -4(1) > = Z \l < MD v p (2) | p | M „.(l) v„(2) >1 < MO I -4(1) I ««(1) > (64) 

n.n' L p J 

Inside the brackets on the right-hand side of (64), we recognize the matrix element of p(l) 
defined in (56). We therefore have: 

< ^~(1) > = I < MD I P(1) | MD > < Ml) I -4(1) IMD > 

n,n' 

= L<MD|p(iM(i)k(i)> 

n 

= Tr { p(l) -4(1) } (65) 

Let us compare this result with (31). We see that the partial trace p(l) enables us to 
calculate all the mean values <^[(1)> as if the system (1) were isolated and had p(l) for 
a density operator. Making the same comment as for formula (17), we see that p( 1) also enables 
us to obtain the probabilities of all the results of measurements bearing on system (1) alone. 

COMMENTS : 

(/) We saw in complement D„, that it is impossible to assign a state vector to system (1) 
[or (2)] when the state of the global system (1) + (2) is not a product state. We now 
see that the density operator is a much more simple tool than the state vector. In all 
cases (whether the global system is in a product state or not, whether it corresponds to 
a pure case or to a statistical mixture), one can always, thanks to the partial trace 
operation, assign a density operator to subsystem (1) [or (2)]. This permits us to calcu¬ 
late all the physical predictions about this subsystem. 

(ii) Even if p describes a pure state (Tr p 2 = 1), this is not in general true of the density 
operators p(l) and p( 2) obtained from p by a partial trace. It can be verified from (56) 
[or (59)] that Tr { p 2 (l) } [or Tr { p 2 (2) }] is not generally equal to 1. This is another 
way of saying that it is not in general possible to assign a state vector to (1) [or (2)], except, 
of course, if the global system is in a product state. 

(in) If the global system is in a product state: 

|f> = Ml)>|x(2)> (66) 

we can verify directly that the corresponding density operator is written : 

p = <r(l) ® t(2) (67) 

with: 

M ) = I < p 0 )> < </>(0 | 

x(2) = | x(2) > < x(2) | (68) 

More generally, we can envisage states of the global system for which the density 
operator p can be factored as in (67) [<r(l) and t(2) can correspond to statistical mixtures 
as well as to pure cases]. The partial trace operation then yields: 

Tr 2 { <t(1)® t(2) } = <r(l) 

T fl {<r(l) ® t (2) } = t(2) (69) 
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An expression such as (67) therefore represents the simple juxtaposition of a system (1), 
described by the density operator <r(l), and a system (2), described by the density ope¬ 
rator t(2). 

(zv) Starting with an arbitrary density operator p [which cannot be factored as in (67)], let 
us calculate p(l) — Tr 2 p and p(2) = Tr x p. Then let us form the product : 

p' = p(l)®p{2) (70) 

Unlike the case envisaged in comment (izz), p' is in general different from p. When the 
density operator cannot be factored as in (67), there is therefore a certain “correlation” 
between systems (1) and (2), which is no longer contained in the operator p' of formula (70). 
(v) If the evolution of the global system is described by equation (36), it is in general 
impossible to find a Hamiltonian operator relating to system (1) alone which would 
enable us to write an analogous equation for p(l). While the definition, at any time, 
of p(l) in terms of p is simple, the evolution of p(l) is much more difficult to describe. 


References and suggestions for further reading: 

Articles by Fano (2.31) and Ter Haar (2.32). Using the density operator to study 
relaxation phenomena : Abragam (14.1), chap. VIII; Slichter (14.2), chap. 5; Sargent, 
Scully and Lamb (15.5), chap. VII. 
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Complement F m 

THE EVOLUTION OPERATOR 


1. General properties 

2. Case of conservative systems 


In §D-l-b of chapter III, we saw that the transformation of 1 1 l/(t 0 )} (the state vector 
at the initial instant / 0 ) into |^(/)> (the state vector at an arbitrary instant) is 
linear. There therefore exists a linear operator U(t, t 0 ) such that: 

\m> = u(t,t 0 )\iHt 0 )> (i) 

We intend to study here the principal properties of U(t, t 0 ), which is, by 
definition, the evolution operator of the system. 


1. General properties 

Since the ket | *j/(t 0 )) is arbitrary, it is clear from (1) that: 

U(t 0 , t 0 ) = 1 (2) 

Also, substituting (1) into the Schrodinger equation, we obtain: 

U(t, t 0 ) | # 0 ) > = H(t) U(t, t 0 ) I «A(t 0 ) > (3) 

from which, for the same reason as above : 

ihj t U(t,t 0 ) = H(t)U(t,t 0 ) (4) 

The first-order differential equation (4) completely defines U(t, t 0 ), taking the 
initial condition (2) into account. Note, moreover, that (2) and (4) can be condensed 
into a single integral equation: 

U(t, t 0 ) = H -■£ f H(t')U(t\t 0 )dt' (5) 

JtQ 

Now let us consider the parameter t 0 , which appears in U(t, t 0 ) as a 
variable t\ just like t. We then write (1) in the form : 

|^(/)> = U(t,n\MO> ( 6 ) 

But | ^(/')) can itself be obtained from a formula of the same type: 

1 ^( 0 ) = U(t',t")\W)> ( 7 ) 
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Substitute (7) into (6): 

|«M0> = (8) 

Since, moreover, | = U(t, z")| i//(z")>, we deduce (| ^(/") ) being arbitrary): 

U{t, t") = U(t, t')U(t\ t") (9) 

It is easy to generalize this procedure and to obtain : 

U(t n , tl )= U(t 3 ,t 2 )U(t 2 , tl ) (10) 

where z t , z 2 ,..., /„ are arbitrary. If we assume that t t < t 2 < t 3 < ... < /„, for¬ 
mula (10) is simple to interpret: to go from z t to /„, the system progresses 
from /, to z 2 , then from t 2 to z 3 , ..., then, finally from t n _ l to /„. 

Set t" = / in (9); taking (2) into consideration, we obtain: 


D = U(t, t')U(t', t) (11) 

or, interchanging the roles of ( and /': 

H= U(t\ t)U(t, t') (12) 

We therefore have : 

[/(/',/)= U ~ 1 (/,/') (13) 

Now let us calculate the evolution operator between two instants separated 
by d/. To do this, write the Schrodinger equation in the form: 

d | <A(0 > = | # + dz) > - 11 i/(t) > 

= - ^ H(t) | \J/{t) > dt (14) 

that is: 

| ^(z + dr) > = pO - ^ H(t) drJ | \J/(t) > (15) 

We then obtain, using the very definition of U(t + dz, / ) : 

U{t + dz, r) = H - 4 H(t) dt (16) 

rl 


U(t + dz, /) is called the infinitesimal evolution operator. Since H(t) is Hermitian, 
U(t + dt, t ) is unitary (cf. complement C„, § 3). It follows that U(t, /') is also 
unitary since the interval [/, /'] can be divided into a very large number of 
infinitesimal intervals. Formula (10) then shows that U(t, /') is a product of 
unitary operators; it is therefore a unitary operator. One can consequently 
write (13) in the form: 

= U(t',t) (17) 

It is not surprising that the transformation U(t, t ') is unitary, that is, that it 
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in _ 

conserves the norm of vectors on which it acts. We saw in chapter III ( cf. §D-l-c) 
that the norm of the state vector does not change over time. 


2. Case of conservative systems 


When the operator H does not depend on time, equation (4) can easily be 
integrated; taking the initial condition (2) into account, we obtain: 

U(t, t 0 ) = e" iH(,_,o)/# (18) 

One can verify directly from this formula all the properties of the evolution 
operator cited in § 1. 

It is very simple to go from formula (D-52) to (D-54) of chapter III using (18). 
It suffices to apply the operator U(t, t 0 ) to both sides of (D-52), noting that, since 
| <p n t > is an eigenvector of H with the eigenvalue E n : 

^(U 0 )k M > = e~ iH(, -' o)l * | <p n x > 

= e - je m (.| ^ ^ (19) 


COMMENTS : 


(0 


When H is time-dependent, one might be tempted to believe, by analogy 
with formula (18), that the evolution operator is equal to the operator V(t, t 0 ) 
defined by: 


V{t , t 0 ) = e 





( 20 ) 


Actually, this is not true, since the derivative of an operator of the form e F(,) 
is not in general equal to F'(t) e F(t) ( cf. complement B,„ §5-c): 

ih~ V(t, t 0 ) # H{t)V(t, t 0 ) (21) 


(/'/') Let us again consider the experiments described in § E-l-b of chapter III. 
As we have already indicated [comment («) of §E-l-b-f}], it is not necessary 
to assume that the measurements of the various observables A, B and C are 
made very close together in time. When the system has had the time to 
evolve between two successive measurements, the variations of the state 
vector can easily be taken into account by using the evolution operator. 
If t 0 , t x and t 2 designate respectively the instants at which the measurements 
of A, B and C are performed, we then replace (E-15) by; 

?a(c)= |<tU^ 2 ,/ 0 )|« o >| 2 ( 22 ) 

and (E-17)by: 

&a c ) = l< v c I w b> I 2 l< w* | Uih.to) I M «> I 2 ( 23 ) 
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We then have, using (9): 

< v c | U(t 2 ,t 0 )\u a > = < V c I l/(r 2 ,ti)l/(r 1 ,t 0 )| > 

= Z < v c \ U(t 2t t t ) | w b > < | C/(t,,t 0 ) | u a > (24) 

b 

Substituting (24) into (22), we see, as in (E-21), that 0> a (c) is not equal 
to I ^b,c). 

b 


References and suggestions for further reading: 

The evolution operator is of fundamental importance in collision therory (see 
references for chapter VIII) and time-dependent perturbation theory (see references for 
chapter XIII). 
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Complement G m 

THE SCHRODINGER AND HEISENBERG PICTURES 


In the formalism developed in chapter III, it is the time-independent operators 
which correspond to the observables of the system ( cf. chap. Ill, §D-l-d). For 
example, the position, momentum and kinetic energy operators of a particle do 
not depend on time. The evolution of the system is entirely contained in that of the 
state vector |i/,(/)> [here written |^ s (t)>, f° r reasons which will be evident later] 
and is obtained from the Schrodinger equation. This is why this approach is 
called the Schrodinger picture. 

Nevertheless, we know that all the predictions of quantum mechanics (proba¬ 
bilities, mean values) are expressed in terms of scalar products of a bra and a ket 
or matrix elements of operators. Now, as we saw in complement C„, these 
quantities are invariant when the same unitary transformation is performed on the 
kets and on the operators. This transformation can be chosen so as to make the 
transform of the ket | ip s (t ) > a time-independent ket. Of course, the transforms of 
the observables cited above then depend on time. We thus obtain the Heisenberg 
picture. 

To avoid confusion, in this complement, we shall systematically assign an 
index S to the kets and operators of the Schrodinger picture and an index H to those 
of the Heisenberg picture. Since the latter picture is used only in this complement, 
the index S can be considered to be implicit in all the other complements and 
chapters. 

The state vector |^ s (r)> at the instant t is expressed in terms of | i^ s (/ 0 )> by 
the relation: 

|lM0> = U(t, ' 0 )|<M'o)> (1) 

where U{t, t 0 ) is the evolution operator (cf. complement F m ). Since this operator 
is unitary, it is sufficient to perform the unitary transformation associated with 
the operator U*(t, t 0 ) to obtain a constant transformed vector \if/ H > : 

| = U'(t> 'o)|<MO> = U f (ti to)U(t, to) | ^sUo) y 

— |^s(*o)^ (2) 

In the Heisenberg picture, the state vector, which is constant, is therefore equal 
to | ^s(O) at time 1 o■ 

The transform A H (t) of an operator A s (t) is given by (complement C„, §2): 

A„(t)= U% t 0 )A s (t)U(t, t 0 ) (3) 

As we have already seen, A H (t) generally depends on time, even if A s does not. 

Nevertheless, there exists an interesting special case in which, if A s is time- 
independent, the same is true of A H : the case in which the system is conservative 
(H s does not depend on time) and A s commutes with H s (A s is then a constant of 
the motion; cf. chap. Ill, §D-2-c). In this case, we have : 

U(t, t 0 ) = Q- ia sU-t 0 )iti (4) 
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If the operator A s commutes with H s , it also commutes with U(t, t 0 ) (cf. comple¬ 
ment B„, § 4-c), so that: 

^h(0 = t 0 )U{u Iq )A s — A s ( 5 ) 

The operators A s and A H are therefore simply equal in this case (in particular, 
H s = H h , and the indices S and H are, in reality, unnecessary for the Hamiltonian). 
Since they are time-independent, we see that they indeed correspond to a constant 
of the motion. 

When A s (i ) is arbitrary, let us calculate the evolution of the operator A H (t ). 
Using relation (4) of complement F m , as well as its adjoint, we obtain : 

£ A„(t) = - -i U'(t, t 0 ) H^t) A s (t) U(t, 1 0 ) + U\t, t 0 )^& U(t, t 0 ) 

+ ^U\t,t 0 )A s (t)H s (t)U(t,t 0 ) ( 6 ) 

In the first and last terms of this expression, let us insert between A s and H s the 
product (J(t , t 0 )U\t , / 0 ), which is equal to the identity operator [formula (17) 
of complement F m ]: 

^A H (t) = -±U'(t,t 0 )H s (t)U(t,t 0 )UXt,t 0 )A ! £t)U(t,t 0 ) 

+ ^U'(t,t 0 ) /l s (r) U(t, t 0 ) U'{t, t 0 ) H s (t) U(t, t 0 ) (7) 

According to definition (3), we finally obtain : 

Mt) = + i*(j t ( 8 ) 


COMMENTS : 

(/) Historically, the first picture was developed by Schrodinger, leading him 
to the equation which bears his name, and the second one, by Heisenberg, who 
calculated the evolution of matrices representing the various operators A H (t) 
(hence the name “matrix mechanics”). It was not until later that the equiva¬ 
lence of the two approaches was proved. 

(ii) Using (8), one immediately obtains equation (D-27) of chapter III as we shall 
now show. In the Heisenberg picture, the evolution of the mean value 

= ^ i^s(o l -^s(o l y 

can be calculated, since: 

( A MO = ( •Ah I A H {t) I y (9) 

On the right-hand side of (9), only A H (t) depends on time, so (D-27) can be 
obtained directly by differentiation. Note, nevertheless, that equation (8) 
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is more general than (D-27) since, instead of expressing the equality of 
two mean values (that is, two matrix elements of operators), it expresses 
the equality of two operators. 

(iii) When the system under consideration is composed of a particle of mass m 
under the influence of a potential, equation (8) becomes very simple. We then 
have (confining ourselves to one dimension): 

w) = Q + n* s , i) (io) 

and therefore [cf. formula (35) of complement C„] : 

H „(t)=^+ V(X H ,t) (11) 

Substituting (11) into (8) and using the fact that [X H , P H ~\ = [X s , P s ] = ih, 
we obtain, by an argument analogous to that of §D-l-d of chapter III: 

dt = m f> " W 

±PM=-^(X H ,t) ( 12 ) 

These equations generalize the Ehrenfest theorem [cf. chap. Ill, relations (D-34) 
and (D-35)]. They are similar to those which give the evolution of the classical 
quantities x and p [cf. chap. Ill, relations (D-36-a) and (D-36-b)]. An advan¬ 
tage of the Heisenberg picture is therefore that it leads to equations which 
are formally similar to those of classical mechanics. 


References and suggestions for further reading: 

The interaction picture: Messiah (1.17), chap. VIII, §14; Schiff (1.18), §24; 
Merzbacher (1.16), chap. 18, §7. 
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Complement H m 
GAUGE INVARIANCE 


1. Outline of the problem: scalar and vector potentials associated with an electromagnetic field ; concept 
of a gauge 

2. Gauge invariance in classical mechanics 

a. Newtons equations 

b. The Hamiltonian formalism 

a. The dynamical variables of the system and their evolution 
p. “True physical quantities” and “non-physical quantities” 

3. Gauge invariance in quantum mechanics 

a. Quantization rules 

b. Unitary transformation of the state vector; form invariance of the Schrodinger equation 
a. The unitary operator T x (t) 

P Time evolution of the state vector 

c. Invariance of physical predictions under a gauge transformation 
a. Behavior of observables 

P Probability of the various possible results of a measurement bearing on a true physical quantity 
y. Probability density and current 


1. Outline of the problem: scalar and vector potentials associated 
with an electromagnetic field ; concept of a gauge 


Consider an electromagnetic field, characterized by the values E(r ; t) of the 
electric field and B(r;r) of the magnetic field at every instant and at all points in 
space : E(r; /) and B(r; t) are not independent since they satisfy Maxwell’s equa¬ 
tions. Instead of specifying these two vector fields, it is possible to introduce a 
scalar potential U(r; t) and a vector potential A(r; t) such that: 

E(r ; t) = - V(7(r ; t) - A(r ; t) 

1 d ( 1 ) 

. B(r; t) = V x A(r; t) 

It can be shown from Maxwell’s equations (c/. appendix III, §4-b-a) that there 
always exist functions U( r; f) and A(r; t) which allow E(r; t ) and B(r; t ) to be 
expressed in the form (1). All electromagnetic fields can therefore be described by 
scalar and vector potentials. However, when E(r; t) and B(r; t ) are given, U(r; t) 
and A(r ; t) are not uniquely determined. It can easily be verified that if we have 
a set of possible values for U (r; t) and A(r; t ), we obtain other potentials U'( r; t) 
and A'(r ; t) which describe the same electromagnetic field by the transformation : 

U'(r;t) = U(r ;t) -ix(r ;t) 

( 2 ) 

A'(r ; t) = A(r ; t) + V*(r;t) 
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where x(r; t) is an arbitrary function of r and t. This can be seen by replacing U(r ; t ) 
by U'( r; t ) and A(r; / ) by A'(r; t ) in (1 ) and verifying that E(r; t ) and B(r; t ) remain 
unchanged. Moreover, it can be shown that relations (2) give all the possible scalar 
and vector potentials associated with a given electromagnetic field. 

When a particular set of potentials has been chosen to describe an electro¬ 
magnetic field, a choice of gauge is said to have been made. As we have just said, 
an infinite number of different gauges can be used for the same field, characterized 
by E(r; t) and B(r; t). When one changes from one to another, one is said to 
perform a gauge transformation. 

It often happens in physics that the equations of motion of a system involve, 
not the fields E(r ; t) and B(r ; t), but the potentials U (r ; t) and A(r ; t). We saw an 
example of this in §B-5-b of chapter III, when we wrote the Schrodinger equation 
for a particle of charge q in an electromagnetic field [ cf relation (B-48) of that 
chapter]. The following question can then be posed : do the physical results predicted 
by the theory depend only on the values of the fields E(r; t) and B(r; t) at all points 
in space, or do they also depend on the gauge used to write the equations ? In the 
latter case, it would obviously be necessary, in order for the theory to make sense, 
to specify in which gauge the equations are valid. 

The aim of this complement is to answer this question. We shall see that in 
classical mechanics (§2), as in quantum mechanics (§3), physical results are not 
modified when a gauge transformation is performed. The scalar and vector 
potentials can then be seen to be calculation tools; actually, all that counts are 
the values of the electric and magnetic fields at all points in space. We shall express 
this result by saying that classical and quantum mechanics possess the property 
of gauge invariance. 


2. Gauge invariance in classical mechanics 

a. NEWTON'S EQUATIONS 

In classical mechanics, the motion of a particle* of charge q and mass m 
placed in an electromagnetic field can be calculated from the force f exerted on it. 
This force is given by Lorentz’ law: 

f = q[ E(r ; t) + v x B(r ; ()] (3) 

where v is the velocity of the particle. To obtain the equations of motion which 
allow one to calculate the position r(t ) of the particle at any instant t, one substitutes 
this equation into the fundamental dynamical equation (Newton’s law): 

(4) 

dr 

In this approach, only the values of the electric and magnetic fields enter into 
the calculation; therefore, the problem of gauge invariance does not arise. 


* For simplicity, we shall assume in this complement that the system under study is composed 
of a single particle. Generalization to a more complex system formed by several particles placed in an 
electromagnetic field presents no difficulties. 
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b. THE HAMILTONIAN FORMALISM 


Instead of adopting the point of view of the preceding section, one can use 
other equations of motion, the Hamilton-Jacobi equations. It is not difficult to 
show (cf. appendix III) that the latter equations are completely equivalent to 
Newton’s equations. However, since we used the Hamiltonian formalism in 
chapter III to quantize a physical system, it is useful to study how a gauge trans¬ 
formation appears in this formalism. Although the scalar and vector potentials 
do not enter into Newton’s equations, they are indispensable for writing those of 
Hamilton. The property of gauge invariance is therefore less obvious for this second 
point of view. 

a. The dynamical variables of the system and their evolution 

To determine the motion of a particle subject to the Lorentz force written 
in (3), one can use the Lagrangian*: 

^(r, v ; t) = ^ mv 2 - q[U( r; t) - v . A(r; t )] (5) 


This expression permits the calculation of the momentum p, which is written: 

p = V v £T(r, v :t) = mv + qA(r ; t) (6) 

It is then possible to introduce the classical Hamiltonian : 

P ; f ) = 2 L [P “ 4 A ( r ; 0 ] 2 + QU(r ; t) (7) 


In the Hamiltonian formalism, the state of the particle at a given time is defined by 
its position r and its momentum p, which we shall call the fundamental dynamical 
variables, and no longer by its position and its velocity, as in § a above (and as 
in the Lagrange point of view). The momentum p (conjugate momentum of the 
position r) must not be confused with the mechanical momentum n : 

n = m\ ( 8 ) 

They are indeed different since, according to (6): 

* = P - <?A(r; t) (9) 


This relation allows us to calculate the mechanical momentum (and therefore the 
velocity) whenever the values of r and p are known. Similarly, all the other 
quantities associated with the particle (kinetic energy, angular momentum, etc...) 
are expressed in the Hamiltonian formalism as functions of the fundamental dyna¬ 
mical variables r and p (and, if necessary, of time). 

The evolution of the system is governed by Hamilton’s equations : 

4 r (0 = V p Jf[r(t), p(f); t] 

0 °) 

■57 P(0 = - V r Jf[r(t), p(t); t] 


* We state without proof a certain number of results of analytical mechanics which are estab¬ 
lished in appendix III. 
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where is the function of r and p written in (7). These equations give the values, 
for all times, of the fundamental dynamical variables if they are known at the initial 
instant. 

To write equations (10), it is necessary to choose a gauge that is, a pair 
of potentials { U(r ; t), A(r; t) } describing the electromagnetic field. What happens 
if, instead of this gauge e f, we choose another one characterized by different 
potentials U'(r; t) and A'(r; t ), but describing the same fields E(r; t) and B(r; t)? 
We shall label with a prime the values of the dynamical variables associated with 
the motion of the particle when the gauge chosen is As we pointed out in §a, 
Newton’s equations indicate that the position r and the velocity v take on, at 
every instant, values independent of the gauge. Consequently, we have: 


\ r'(t) = r(f) (11-a) 

= n(t) (11-b) 

Now, from (9): 

7i(t) - p(f) - qA[r(t);r] 

*'(0 = P'(0 - <?A'[r'(t); t] (12) 

Therefore, the values p (t) and p'(t) of the momentum in the gauges ^ and are 
different; they must satisfy: 

p'(t) - <?A'[r'(t); t] = p(r) - qA[r(t); f] (13) 

If *(r; 0 is the function appearing in formulas (2) which govern the gauge trans¬ 
formation from / to , the values of the fundamental dynamical variables are 
transformed according to the formulas: 

f r'(r) = r(t) (14-a) 

LP'W = pW + qVxfc (0; '] (14-b) 


In the Hamiltonian formalism, the value at each instant of the dynamical variables 
describing a given motion depends on the gauge chosen. Moreover, such a result is not 
surprising since, in (7) and (10), the scalar and vector potentials appear explicitly 
in the equations of motion for the position and momentum. 

p. “ True physical quantities'’’ and “ non-physical quantities" 

(i) DEFINITIONS 

We have just seen, in relations (14) for example, that it is possible to 
distinguish between two types of quantities associated with the particle : those which, 
like r or it, have identical values at all times in two different gauges, and those 
which, like p, have values which depend on the arbitrarily chosen gauge. We are 
thus led to the following general definition: 

— A true physical quantity associated with the system under consideration 
is a quantity whose value at any time does not depend (for a given motion of 
the system) on the gauge used to describe the electromagnetic field. 
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— A non-physical quantity, on the other hand, is a quantity whose value 
is modified by a gauge transformation; thus, like the scalar and vector potentials, 
it is seen to be a calculation tool, rather than an actually observable quantity. 


The problem then posed is the following: in the Hamiltonian formalism, 
all the quantities associated with the system appear in the form of functions of 
the fundamental dynamical variables r and p; how can we know whether such 
a function corresponds to a true physical quantity or not? 


(ii) CHARACTERISTIC RELATION OF TRUE PHYSICAL QUANTITIES 

Let us first assume that a quantity associated with the particle is described, 
in the gauge /, by a function of r and p (which may depend on time) which 
we shall denote by .^(r, p; t). If to this quantity corresponds, in another 
gauge the same function J^r', p'; /), the quantity is clearly not truly physical 
[except in the special case where the function 3* depends only on r and not on p; 
see equations (14)]. Since the values of the momentum are different in the two 
gauges # and ^the same is obviously true for the values of the function &. 

To obtain the true physical quantities associated with the system, we must 
therefore consider functions ^/(r, p; t) whose form depends on the gauge chosen 
(this is why we label these functions with an index /). We have already seen 
an example of such a function: the mechanical momentum it is a function of r 
and p through the intermediate of the vector potential A [c/. (9)]. In this case, 
the function is different in the two gauges f and that is, it is of the form 
n/(r, p; t). The definition given in (i) thus implies that the function */(r, p; 0 
describes a true physical quantity on the condition that: 

[i-(0> P<0; f] = %/ [r '(r), P'(t); t] (15) 

where r(t) and p(r) are the values taken on by the position and momentum 
in the gauge /, and r'(/) and p'(t) are their values in the gauge If we 
substitute relations (14) into (15), we obtain: 

[r(t)» P(0: f ] = 3> [ r ( f )’ P(0 + 9 Vx(r(t); t); t] (16) 

where this relation must be satisfied at every instant t and for all possible 
motions of the system. Since, when t is fixed, the values of the position and the 
momentum can be chosen independently, both sides of (16) must in fact be 
the same function of r and p, which is written : 

[r, p; t] = Sf,.[r, p + q Vy(r; t); t] (17) 

This relation is characteristic of the functions [r, p; /] associated with 
true physical quantities. Therefore, if one considers the function ^/ [r, p; /] 
for the gauge Jf\ and if one replaces p by p + <jrVy(r; /) [where x(r; t) defines, 
according to (2), the gauge transformation from f to f '], one obtains a new 
function of r and p which must be identical to ^ [r, p; /]. If this is not the case, 
the function considered corresponds to a quantity which is not truly physical. 
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(iii) EXAMPLES 

Let us give some examples of functions [r, p; /] which describe true 
physical quantities. We have already encountered two : those corresponding to 
the position and to the mechanical momentum; the first is simply equal to r 
and the second to: 

Pi 0 = P - <?A(r;f) (18) 

Since relations (11) express the fact that r and it are true physical quantities, 
we know a priori that relation (17) is satisfied by the corresponding functions. 

However, let us verify this directly in order to familiarize ourselves with the use of this 
relation. As regards r, we are dealing with a function which does not depend on p and whose 
form does not depend on the gauge*; this immediately implies (17). As regards n, relation (18) 
yields: 

*/'(■% p;0 = P - <?A'(r; 0 (19) 

Replace in this function p by p + #Vx(r; t ); we obtain the function : 

P + q?x( r ; 0 - <?A'(r; t) = p - qA(r ; t) (20) 

which is none other than ?i/(r; p; t); relation (17) is therefore satisfied. 


Other true physical quantities are the kinetic energy: 

= 2 ~[P - < 7 A(r;f)] 2 (21) 

and the moment, with respect to the origin, of the mechanical momentum : 

p; t) = r x [p - <?A(r; f)] (22) 


In general, we see that whenever a function of r and p has the form : 
P; 0 = F[r. P - <?A(r; r)] 


(23) 


(where F is a function whose form is independent of the gauge J chosen), 
we obtain a true physical quantity**. This result makes sense since (23) really 
expresses the fact that the values taken on by the quantity considered are 
obtained from those of r and n, which we know to be gauge-invariant. 

Let us also give some examples of functions describing quantities which 
are not true physical quantities. In addition to the momentum p, we can cite 
the function : 


*(P) 



(24) 


* It is not difficult to verify that, in general, any function0 (r, t) which depends only on r (and, 
possibly, on the time), and whose form is the same in any gauge / chosen, describes a true physical 
quantity. 

** One could also construct functions associated with true physical quantities in which the poten¬ 
tials are involved in a more complex way than in (23) (for example, the scalar product of the particle 
velocity and the electric field at the position of the particle). 
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which must not be confused with the kinetic energy written in (21), and, in 
general, any function of p alone (and, possibly, of the time). Similarly, the 
angular momentum: 

&{r, p) = r x p (25) 

cannot be considered to be a true physical quantity. Finally, let us cite the 
classical Hamiltonian, which, according to (7), is the sum of the kinetic energy 
y^(r, p; /), which is a true physical quantity, and the potential energy qif. Now, 
the latter [which should rigorously be written in the form of a gauge-dependent 
function Uy (r; 0] is not a true physical quantity since, at every point in space, its 
value changes when the gauge is changed. 


3. Gauge invariance in quantum mechanics 

In chapter III, we introduced the postulates of quantum mechanics by 
starting from the Hamiltonian formulation of classical mechanics. We are thus 
led to ask if the problem of gauge invariance, easily resolved in classical 
mechanics because of the existence of Newton’s equations, is more complex in 
the framework of quantum mechanics. The following question then arises: 
are the postulates stated in chapter III valid for any arbitrarily chosen gauge f 
or only for a particular gauge ? 

In answering this question, we shall be guided by the results obtained in the 
preceding paragraph. Following the same type of reasoning, we shall see that 
there exists a close analogy between the consequences of a gauge transformation 
in the classical Hamiltonian formalism and in the quantum mechanical formalism. 
We shall thus establish the gauge invariance of quantum mechanics. 

To do this, we shall begin (§a) by examining the results obtained when 
the quantization rules are applied in the same way in two different gauges. 
We shall then see (§b) that, like in classical mechanics, where the values of 
the dynamical variables generally change when the gauge is changed, a given 
physical system must be characterized by a mathematical state vector | i// ) which 
depends on the gauge. The transition from a state vector corresponding to one 
gauge jf to that of another gauge /' is effected by a unitary transformation. 
The form of the Schrodinger equation, however, always remains the same (as do 
Hamilton’s equations in classical mechanics). Finally, we shall examine the behavior, 
under a gauge transformation, of the observables associated with the system (§ c). 
We shall then see that the simultaneous modification of the state vector and the 
observables is such that the physical content of quantum mechanics does not 
depend on the gauge chosen. Moreover, we shall demonstrate this by showing that 
the density and the probability current values are gauge invariant. 

a. QUANTIZATION RULES 

The state space of a (spinless) particle is always S t . However, we are 
clearly led by the results of §2 above to expect that the operator associated with 
a given quantity may be different in two different gauges. We shall therefore 
label these operators with an index f. 


321 




COMPLEMENT H 


The quantization rules associate, with the position r and the momentum p 
of the particle, operators R and P acting in § t such that: 

[X, PJ - [y, PJ = [Z, PJ = * (26) 

(where all the other commutators between components of R and P are zero). 
In the { | r ) } representation, the operator R acts like a multiplication by r, 

and P like the differential operator — V. These rules are the same in all gauges. 

We can therefore write: 

f R/' = R/ (27-a) 

lP/'=P/ (27-b) 

In fact, these equations enable us to omit the index J for the observables R 
and P, and we shall henceforth do so. 

The quantization of all other quantities associated with the particle is derived 
from this: in a given gauge, one takes the function of r and p which gives 
the classical quantity considered and (after having symmetrized, if necessary) 
replaces r by the operator R and p by P. We thus obtain the operator 
which, in the gauge chosen, describes this quantity. Consider some examples : 

— The angular momentum operator, obtained from r x p, is the same in 


all gauges: 

L/'=L/ (28) 

— The operator associated with the mechanical momentum, on the other 
hand, depends on the gauge chosen. In the gauge it is given by: 

n/=P-qA(R;t) (29) 

If the gauge is changed, it becomes: 

II/'= P - <jA'(R;t) (30) 

whose action in i t is different from that of 11^ : 

IV = 11/- gV*(R; t) (31) 

— Similarly, the operator*: 

A/ = R x 11/ = R x [P — qA(R; t)] (32) 


which describes the moment of the mechanical momentum, explicitly involves 
the vector potential chosen. 

— Finally, the Hamiltonian operator is obtained from formula (7) : 

=[p ■ qA{R : + ; o < 33 ) 


* It can be verified, by using the commutation relations of R and that it is not necessary to 
symmetrize expression (32). 
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It is obvious that in another gauge, it becomes a different operator, since : 


H/ ' = ~L [ p “ qA ’ {R; r )] 2 + ^'(R; 0 # hj 


(34) 


b. UNITARY TRANSFORMATION OF THE STATE VECTOR; 

FORM INVARIANCE OF THE SCHRODINGER EQUATION 

a. The unitary operator T x (t) 

In classical mechanics, we denoted by { r(t), p(f) } and { r'(r), p'(r) } the 
values of the fundamental dynamical variables characterizing the state of the 
particle in two different gauges J and /In quantum mechanics, we shall 
therefore denote by | \j/(t) ) and 1 1 //'(t) ) the state vectors relative to these two gauges, 
and the analogue of relations (14) is thus given by relations between mean 
values : 

< M) i R/'i m > = < m i r / i m > (35-a) 

< rit) 1 p/ -1 m > = < m \p/ + q v*(r ; 01 m > (35-b) 

Using (27), we see immediately that this is possible only if 1 1 j/{t)} and | ^'(t)) 
are two different kets. We shall therefore seek a unitary transformation T x (t) which 
enables us to go from 1 1 //(t) ) to | i //'(t) ) ; 

I <A'(0 > = T x (t) | ^(f) > (36-a) 

TKt)T x (t) = T x (t)T x (t) = D (36-b) 

Taking (27) into account, we see that equations (35) are satisfied for any 1 1 J/(t) > on condition 
that: 

f T\(t) R T x (t) = R (37-a) 

l T»(() P T x (t) = P + <fV*(R; t) (37-b) 

Multiplying (37-a) on the left by T x (t), we obtain: 

Rr r (0 = r 1 (t)R (38) 

The desired unitary operator commutes with the three components of R; it can therefore be 
written in the form: 

T x (t ) = e iFm,} ( 39 ) 

where F(R ; t) is a Hermitian operator. Relation (48) of complement B„ then allows us to write : 

[P,7,(0] =h\{F(R;t)}T x (t) (40) 

If we multiply this equation on the left by T*(t) and substitute it into (37-b), we easily obtain 
the relation: 

h\{F(R;t)} =q\ X ( *;/) (41) 

which is satisfied when: 


F(R;t) = F o (0+-x(R;0 


( 42 ) 


Omitting the coefficient F 0 (r), which corresponds, for the state vector | ils(t) >, to a global phase 
factor of no physical consequence, we obtain the operator T x (t ): 

T x (t) = e‘^ z,R:,) (43) 

If, in (36-a), T x (t) is this operator, relations (35) are automatically satisfied. 
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COMMENTS : 

(j) In the { | r ) } representation, relations (36-a) and (43) imply that the 
wave functions i^(r, /) = < r 1 1 j/{t) > and ij/'(r, t) = < r | ^'(t) > are related 
by: 

•A'(r, t) = e'ft* <M V(r, 0 (44) 

For the wave function , the gauge transformation corresponds to a phase 
change which varies from one point to another , and is not , therefore , a 
global phase factor. The gauge invariance of physical predictions obtained by 
using the wave functions ^ or if/’, is therefore not obvious a priori. 

(ii) If the system under study is composed of several particles having positions 
r 1? r 2 , ... and charges q t , q 2 , ..., (43) must be replaced by : 

T x (t) = iy\t) T< 2 \t )... 

_ e 7jto»x( | *i.o+fl2*(*2.rt+ • 1 /4^\ 


p. Time evolution of the state vector 

Now let us show that if the evolution of the ket | ^(t)) obeys, in the 
gauge the Schrodinger equation: 

I At) > = H At) | At) > ( 46 ) 

the state vector \if/'{t)y given by (36) satisfies an equation of the same form 
in the gauge f ': 

m-^|f(0> = HAt)\m> (47) 

where Hj (t) is given by (34). 

To do this, let us calculate the left-hand side of (47); it is written: 
ihj t \m> = ihj t { T x (t) | m > } 

= ih {J t T x (t) } | m > + ihTft)j t | ij/U) > ( 48 ) 

that is, according to (43) and (46)* : 
d 3 

ih-\m> = - T X U) | <A(r) > + T x U)H f U) \ *(r) > 

= {- + HAt)) \n»> ( 49 ) 


* The function x depends on R and not on P; consequently x (R, 0 commutes with —WR;r). 

ct 

This is why T x (t) can be differentiated as ifx(R, f) were an ordinary function of the time and not an ope¬ 
rator (cf complement B„, comment of §5-c). 
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where H^(t) designates the transform of Hf{t) by the unitary operator T x (t): 

H/U) = T x {t)Hf{t)1$t) (50) 

Equation (47) will therefore be satisfied if : 

H r (t) = Hj(t) - q —> x(R > 0 (51) 

ot 

Now Hj (/) is given by: 

Hs U) = ^-[P - ?A(R; r)] 2 + qU( R; l) (52) 

2m 

where R and P designate the transforms of R and P by the unitary operator T x (t). Accor¬ 
ding to (37) : 

f R = T x (t) R T x '(t) = R (53-a) 

L p = T x (t) P Tj(f) = P - <?Vx(R; f) (53-b) 

These relations, substituted into (52), indicate that : 

H/ (') = ^ [ p - <?A(R; t) - gV X (R ; 0] 2 + qU(R ; t) (54) 


Using relations (2) to replace the potentials relative to the gauge / by those relative to / 
we then obtain, using (34), relation (51). Therefore, the Schrodinger equation can be written 
in the same way in any gauge chosen. 


c. INVARIANCE OF PHYSICAL PREDICTIONS 
UNDER A GAUGE TRANSFORMATION 

a. Behavior of the observables 

Under the effect of the unitary transformation T x {t), any observable K 
is transformed into K, with: 

K = T x (t)KT x '(t) (55) 

We have already seen, in (53), that while R is simply equal to R, P is not 
equal to P. Similarly, Tl? is different from II/ since: 

ft/ = P - </A(R ; t) 

= P-< ? Vx(R;,)- 9 A(R;0 

= n/ - <?Vy(R; t) (56) 

Taking (27-a) and (31) into account, we see that relations (53-a) and (56) imply 
that the observables R and II/, associated with true physical quantities (position 
and mechanical momentum) are such that : 


r R/ = r/ 
i ft/ = n/' 


(57) 


On the other hand, the momentum P (which is not a true physical quantity) 
does not satisfy an analogous relation, since, from (27-b) and (53-b): 


P/ / P/ 


(58) 
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We shall see that this result is a general one : in quantum mechanics, for every true 
physical quantity, there is an operator G? (t) which satisfies: 

Gy(t) = Gf.(t) (59) 

This relation is the quantum mechanical analogue of the classical relation (16). 
It shows that, except for the special case of R or a function of R alone, the 
operator corresponding to a true physical quantity depends on the gauge . 
We have already seen examples of this in (29) and (32). 

To prove (59), one need only apply the quantization rules stated in chapter III to a 
function 2?/(r, p; t) and use relation (17), the characteristic relation for true physical classical 
quantities. We therefore replace r and p by the operators R and P and obtain (if necessary, after 
a symmetrization with respect to these operators) the operator G? (i). If the form of the func¬ 
tion depends on the gauge chosen, the operator Gy(?) also depends on /. When the quantity 
associated with 9^ is a true physical quantity, we have, according to (17): 

9, [R, P; /] = 9 r [R, P + ?V*(R; t); t ] (60) 

Applying the unitary transformation T x (t) to this relation, we obtain: 

£/[*.P It] = 9,-[ R,P + ^ Z (R;/);/] 

= ^[R,P + 9 V X (R;t);t] (61) 

That is, taking (53) into account: 

#/[K f P;l] [R,P;0 (62) 

After symmetrizing, if necessary, both sides of this relation, we indeed obtain (59). 


Let us give some examples of true physical observables. In addition to R 
and n /5 we can cite the moment A/ of the mechanical momentum [c/. (32)], 
or the kinetic energy: 


f > = -S = 2b[ P -« A < R ^ i 


On the other hand, P and L are not true physical quantities; neither is the 
Hamiltonian, since relation (51) implies in general that : 

fl/(0 # H/it) (64) 


COMMENT : 

In classical mechanics, it is well known that the total energy of a 
particle moving in a time-independent electromagnetic field is a constant 
of the motion. It is indeed possible in this case to limit oneself to 
potentials which are also time-independent. We see from (51) that one then 
has: 

H/ = Hr (65) 

In this particular case, Hj is indeed a true physical observable which 
can therefore be interpreted to be the total energy of the particle. 
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p. Probability of the various possible results of a measurement bearing on a true 
physical quantity 

Assume that at time t we want to measure a true physical quantity. 
In the gauge the state of the system is described at this instant by the ket* | i)/ >, 
and the physical quantity, by the observable Gy. Let | (p n ) be an eigenvector of 
Gj , with the eigenvalue g n (assumed, for simplicity, to be non-degenerate): 

G / | <Ph ) = 9n | ) (66) 

As calculated in the gauge Jf from the postulates of quantum mechanics, 
the probability of obtaining g n in the measurement envisaged is equal to : 

= l< <Pn | *A >| 2 (67) 

What happens to this prediction when the gauge is changed? According to 
(59), the operator Gy associated with the quantity under consideration in the new 
gauge f can have the ket: 

| < > = T x | <p n > ( 68 ) 

as an eigenvector, with the same eigenvalue g n as in (66). That is: 

= T x Gs T\ T x \(p n y 

= T x g n \(p n } = g„\(p' n > (69) 

In the gauge g„ still appears, therefore, as a possible measurement result. 
Moreover, calculation of the corresponding probability yields the same value as in 
the gauge /, since, according to (36-a) and (68): 

< <p'n | r > = < (Pn I 7? T x | lit > = < (P„ | ^ > (70) 

We have thus verified that the postulates of quantum mechanics lead to 
gauge-invariant physical predictions : the possible results of any measurement and 
the associated probabilities are invariant under a gauge transformation. 

y. Probability density and current 

Let us calculate, from formulas (D-9) and (D-20) of chapter III, the 
probability density p(r, t) and probability current J(r, t) in two different gauges / 
and For the first gauge, we have: 


p(r, r) = |^(r, t)| 2 

(71) 

and : 

f l_ __ "->1 


J(r, t) = 1 Re<U*(r, t)| * V - qA(r; t) L(r, t)j 

(72) 

Relation (44) immediately shows that: 


<-*■ 

11 

II 

(73) 


* We do not indicate the time dependence because all the quantities must be evaluated at the 
time t when we want to perform the measurement. 


327 


Cohen-Tannoudji. - Quantum mechanics. - I. 


12 




COMPLEMENT H„, 


Moreover, it also implies that: 

J'(f , l ) Re<Je~'fi* <M V*(r> Op v - «A'(r; t)J e l « x<r; 'V(r, t)j> 

= ^ Re<U*(r, Op v ~ 4 A'(r; t) + qV X (r; oj> (74) 

that is, taking (2) into account: 

J'(r, t) = J(r, t) (75) 

The probability density and current are therefore invariant under a gauge 
transformation. This result could have been foreseen, moreover, from the conclusions 
of § P above, since \cf. relation (D-19) of chapter III] p(r, t) and J(r, t) can be 
considered to be mean values of the operators f r > < r and: 

Mr) = ^ { | r > < r | n/ + n/ | r > < r | } ( 76 ) 

It is not difficult to show that these two operators satisfy relation (59). They 
therefore describe true physical quantities whose mean values are gauge-invariant. 


References and suggestions for further reading: 

Messiah (1.17), chap. XXI, §§20 to 22; Sakurai (2.7), §8-1. 

Gauge invariance, extended to other domains, has recently roused considerable 
interest in particle physics; see, for example, the article by Abers and Lee (16.35). 
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PROPAGATOR FOR THE SCHRODINGER EQUATION 


1. Introduction 

2. Existence and properties of a propagator K( 2, 1) 

a. Existence of a propagator 

b. Physical interpretation of K(2 y 1) 

c. Expression for K( 2, l) in terms of the eigenstates of H 

d. Equation satisfied by K( 2, 1) 

3. Lagrangian formulation of quantum mechanics 

a. Concept of a space-time path 

b. Decomposition of K( 2, 1) into a sum of partial amplitudes 

c. Feynman's postulates 

d. The classical limit and Hamilton's principle 


1. Introduction 

Consider a particle described by the wave function ^(r, f). The Schrodinger 
equation enables us to calculate — t), that is, the rate of variation of t) 

with respect to t. It therefore gives the time evolution of the wave function ip( r, t), 
using a differential point of view. One might wonder if it is possible to adopt a more 
global (but equivalent) point of view that would allow us to determine directly the 
value ij/(r 0 , t) taken on by the wave function at a given point r 0 and a given time t 
from knowledge of the whole wave function t') at a previous time t' (which is 
not necessarily infinitesimally close). 

To consider this possibility, we can take our inspiration from another 
domain of physics, electromagnetism, where both points of view are possible. 
Maxwell’s equations (the differential point of view) give the rates of variation of 
the various components of the electric and magnetic fields. Huygens’ principle 
(the global point of view) permits the direct calculation, when a monochromatic 
field is known on a surface I, of the field at any point M: one sums the fields 
radiated at the point M by fictional secondary sources N u N 2 , N 2 , ... situated 
on the surface Z and whose amplitude and phase are determined by the value of 
the field at N u N 2 , N 3 , ... (fig. 1). 


V 



n\ —' ' 





FIGURE 1 

*7 

L 


In a diffraction experiment, Huygens’ principle permits 
the calculation of the electric field at the point Af, as a 
sum of fields radiated by secondary sources N t , N 2 , JV 3 , ... 
situated on a surface T. 
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We intend to show in this complement that there exists an analogue in 
quantum mechanics of Huygens’ principle. More precisely, we can write, for f 2 > : 

<A( r 2> t 2 ) = | <* 3 r t ^(r 2 ; ri . *i) *i) (1) 

a formula whose physical interpretation is the following: the probability amplitude 
of finding the particle at r 2 at the instant t 2 is obtained by summing all the 
amplitudes “radiated” by the “secondary sources” (r l5 /), (r' l5 /j)... situated in 
space-time on the surface t = t v each of these sources contributing to a degree 
proportional to ^(r t , / t ), >j/(r[, t x ), ... (fig. 2). We shall prove the preceding 
formula, calculate K, called the propagator for the Schrodinger equation, and 
study its properties. We shall then indicate very qualitatively how it is possible to 
present all of quantum mechanics in terms of K (the Lagrangian formulation of 
quantum mechanics; Feynman’s point of view). 



FIGURE 2 

The probability amplitude \j/( r 2 » t 2 ) can be obtained by 
summing the contributions of the various amplitudes 
^( r i> ' 1)9 * 1)9 etc... corresponding to one previous 

instant With each of the arrows of the figure is associa¬ 
ted a “propagator” K( r 2 , f 2 ; r t , fj), K{ r 2> / 2 ; rj, etc... 


2. Existence and properties of a propagator K{ 2.1) 

a. EXISTENCE OF A PROPAGATOR 

The problem is to link directly the states of the system at two different 
times. This is possible if we use the evolution operator introduced in comple¬ 
ment F,,„ since we can write: 

I W 2 )> = tf(f2.*i)|ri'i)> (2) 

Given 1 il>(t 2 ) >, it is easy to find the wave function ^(r 2 , t 2 ) : 

«/'(r 2 ,f 2 ) = <r 2 |^ 2 )> ( 3 ) 

Substituting (2) into (3) and inserting the closure relation: 

|d 3 ri | r x > < r x | = H (4) 
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between U(t 2 , t 2 ) and | tyih) )> we obtain: 

^(r 2 ,f 2 ) = J d 3 r v < r 2 | l/(t 2 , r,) | r, > < r, | ^(f,) > 

= < r 2 | U(hJi) | G > ,fi) (5) 

The result is thus a formula identical to (1), on the condition that we set: 
< r 2 | U{t 2 ,t y ) | r L > = K(r 2 ,t 2 : r,,/,) 

In fact, since we want to use formulas of the type of (1) only for t 2 ) t 2 , we 
can set K = 0 for t 2 < t v The exact definition of K then becomes: 

K(r 2 ,t 2 -,r l ,t l ) = <r 2 | U{t 2 ,t,) | r, > 0(t 2 - /,) (6) 

where 9(t 2 — t 2 ) is the “step function”: 

d(t 2 - f,) = 1 if t 2 > r, 

d(t 2 - tj) = 0 if t 2 < /, 

The introduction of 9(t 2 — t l ) is of both physical and mathematical interest. 
From the physical point of view, it is a simple way of compelling the secondary 
sources situated on the surface t — t 2 of figure 2 to “radiate” only towards the 
future. For this reason, K{t 2 , t 2 ; r ls fj) as defined by (6) is called the retarded 
propagator. From the mathematical point of view, we shall see later that 
K(t 2 , t 2 ; r lf f 2 ), because of the factor d(t 2 — t 2 ), obeys a partial differential 
equation whose right-hand side is a delta function, that is, the equation which 
defines a Green’s function. 

COMMENTS : 

(/) Note, however, that equation (5) remains valid even if t 2 < t L . It is 

possible, moreover, to introduce mathematically an “advanced” propagator 
which would be different from zero only for t 2 < t t and which would 
also obey the equation defining a Green’s function. Since the physical 
meaning of such an advanced propagator is not obvious at this stage, 
we shall not study it here. 

(ii) When no ambiguity is possible, we shall simply write K(2, 1) for 

^( r 2> ’ r l> )• 


b. PHYSICAL INTERPRETATION OF K(2. 1) 

This interpretation follows very simply from definition (6): K(2, 1) represents 
the probability amplitude that the particle, starting from the point r, at time , 
will arrive at the point r 2 at a later time t 2 . If we take as the initial state at time t { 
a state localized at the point r, : 

I «A(/i)> = k> (8) 
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at time t 2 , the state vector has become: 

| W 2 )> = ^2.'l)|^l)> = ^2^l)|>*l> (9) 

The probability amplitude of finding the particle at the point r 2 at this time is then : 
< r 2 I 'l'U 2 )> = < r 2 I U{t 2 ,t t ) | r i > (10) 

c. EXPRESSION FOR K{ 2,1) IN TERMS OF THE EIGENSTATES OF H 

Assume that the Hamiltonian H does not depend explicitly on time, and 


call | <p n > and E n its eigenstates and eigenvalues: 

H\(p n y = E a \cp„} (11) 

According to formula (18) of F,„, we have: 

U{t 2 ,t x ) = ( 12 ) 

The closure relation: 

2>.X*.|-H (13) 

n 

enables us to write (12) in the form: 

Uih’h) = X | > < <P. | 04) 

n 

that is, taking (11) into account: 

U(t 2 ,t t ) = le-^-^I^X^I (15) 

n 

To calculate K(2, 1), it then suffices to take the matrix element of both sides 
of (15) between <r 2 | and |r 2 > and to multiply it by Q{t 2 — tj). Since: 

<r, k„> = vjr,) (16) 

<*. I'i >-«*(',) (17) 

there results: 

X(r 2 ,t 2 ;ri,ti) = d(t 2 - t,) £ <P„*(r,)<P n (r 2 ) (18) 

n 


d. EQUATION SATISFIED BY K( 2.1) 

<p B (r 2 )e -1 ' £ "' 2/S is a solution of the Schrodinger equation. We deduce from 
this that, in the { [ r > } representation: 

r* J- - r 2 ,* V 2 ^.(r 2 ) e'®"* - 0 (19) 

(where V 2 is a condensed notation which designates the three opera- 
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d 3 d 

tors ■*— ,-r—,-r —). Let us then apply, to both sides of equation (18), the opera- 


dx 2 ’ dy 2 ' dz. 


tor 



which acts only on the variables r 2 and t 2 . We know \cf. appendix II, relation (44)j 
that: 

^ 0(h-’,)-^h -I,) ( 20 ) 


Consequently, using (19), we obtain: 




) K(r 2 ,t 2 ;r,,f,) = 

iJi3(t 2 - r,) £ <P n *(r,)<P n (r 2 ) 

n 


( 21 ) 


Because of the presence of d(t 2 - /,), we can replace t 2 — t l by zero in the sum 
over n which appears on the right-hand side of (21). This makes the exponential 

equal to 1. We are thus left with the quantity X < Pn( r 2 ) < P*( r i)’ which, according 

n 

to (13), (16) and (17), is equal to d(r 2 — r 2 ) [taking the matrix element of (13) 
between <r 2 | and |r t >]. Finally, K satisfies the equation: 



K( r 2 , t 2 ; r,, t,) = ih S(t 2 


~ r,) 


( 22 ) 


The solutions of equation (22), whose right-hand side is proportional to a four¬ 
dimensional “delta function”, are called Green's functions. It can be shown that, 
to determine K(2, 1) completely, it suffices to associate with (22) the boundary 
condition: 

K{t 2 ,t 2 -,t l ,t 2 ) = 0 if t 2 <t 2 (23) 

Equations (22) and (23) have interesting implications, in particular with 
regard to perturbation theory, which we shall study in chapter XI. 


3. Lagrangian formulation of quantum mechanics 

a. CONCEPT OF A SPACE-TIME PATH 

Let us consider, in space-time, the two points (r 2 , t 2 ) and (r 2 , t 2 ) (cf. fig. 3; 
t is plotted as the abscissa, and the ordinate axis represents the set of the three 
spatial axes). Choose N intermediate times t a , (1 = 1, 2,..., N), evenly spaced 
between t y and t 2 : 

< t a2 < - < f iN _, < < h (24) 
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and, for each of them, a position r a( in space. We can thus construct, when N 
approaches infinity, a function r(f) (which we shall assume to be continuous) such 
that: 

r('i) = r i (25-a) 

r(t 2 ) = r 2 (25-b) 

r(t) is said to define a space-time path between (r x , t x ) and (r 2 , t 2 ): such a path might 
be thought of as the trajectory of a physical point leaving the point r x at time t t and 
arriving at r 2 at time t 2 . 



FIGURE 3 

Diagram associated with a “ space- 
time path ” : one picks N interme¬ 
diate times t a .(i = I, 2 . N) 

evenly spaced between /, and t 2 , 
and chooses for each of them a 
value of r. 


b> DECOMPOSITION OF K(2. 1) INTO A SUM OF PARTIAL AMPLITUDES 

Let us first return to the case where the number N of intermediate times 
is finite. Formula (10) of complement F in enables us to write : 

U(t 2 , tl ) = U(t 2 , t a JU(t x „, /,„_,)... U(t a2 , tJU(t Xi , M (26) 

Let us take the matrix elements of both sides of (26) between <r 2 1 and | r 2 > and 
insert the closure relation relative to the ( |r> } representation for each inter¬ 
mediate time t aj . According to (6) and (24), we thus obtain: 


K{ 2, 1) = J d3r «N d3r *' K ( 2 ' - x 


x X(a 2 , aj X(a„ 1) (27) 

Now consider the product : 

K(2, <x n )K(<x n , <x N _ l ) ... K(<x 2 , a,) AT(a,, 1) (28) 

Generalizing the argument of §2-b, we can interpret this term as being the 
probability amplitude for the particle, having left the point 1 (r t , f x ),to arrive at 
the point 2 (r 2 , t 2 ), having passed successively through all the points a f (r a . , r ) of 
figure 3. Note that, in formula (27), one is summing over all possible positions r 3 
at each time t„ . 

a i 

We now let N approach infinity*. A series of points a f then defines a space-time 
path between 1 and 2, and the product (28) associated with it becomes the proba- 


* In this treatment, we make no attempt to be mathematically rigorous. 
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bility amplitude which the particle has of following this path. Of course, the 
number of integrations in formula (27) becomes infinite. It is understandable, 
however, that the summation over the set of possible positions at each time 
should reduce to a summation over the set of possible paths. K( 2, 1) is thus seen 
to be a sum (in fact, an integral) which corresponds to the coherent superposition 
of the amplitudes associated with all possible space-time paths starting from 1 
and ending at 2. 

c. FEYNMAN'S POSTULATES 

The concepts of a propagator and a space-time path permit a new formu¬ 
lation of the postulate concerning the time evolution of physical systems. We shall 
outline here such a formulation for the case of a spinless particle. 

We define K( 2, 1) directly as being the probability amplitude for the particle, 
starting from at time /,, to arrive at r 2 at time t 2 . We then postulate that: 

(/) K( 2, 1) is the sum of an infinity of partial amplitudes, one for each of 
the space-time paths connecting (r l5 r x ) with (r 2 , t 2 ). 

(ii) The partial amplitude K r (2, 1) associated with one of these paths T is 
determined in the following manner: let S r be the classical action calculated along T, 
that is: 

S r = f ^(r,p,t)dt (29) 

J<n 

where & (r, p, /) is the Lagrangian of the particle (cf. appendix III). K r ( 2, 1) is then 
equal to: 

K r (2, 1) = N ei Sr (30) 

where N is a normalization constant (which can, moreover, be determined explicitly). 

It can be shown that the Schrodinger equation follows as a consequence of 
these two postulates. Similarly, one can deduce the canonical commutation relation 
between the components of the observables R and P. The two preceding postulates 
therefore permit a formulation of quantum mechanics which is different from that 
of chapter III, but equivalent. 


d. THE CLASSICAL LIMIT AND HAMILTON'S PRINCIPLE 


The formulation which we have just evoked is particularly useful for 
discussing the relation between quantum and classical mechanics. 

Consider a situation in which the actions S r are much larger than h. In this 
case, the variation AS r of the action between two different paths, even if its relative 


value is small 



, is usually much larger than ft. Consequently, the phase 


of K r ( 2, 1) varies rapidly, and the contributions to the global amplitude K(2, 1) of 
most of the paths T cancel out by interference. Let us assume, however, that there 
exists a path T 0 for which the action is stationary (that is, does not vary, to first 
order, when one goes from T 0 to another infinitesimally close path). The ampli- 
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tude K ro {2, 1) then interferes constructively with those of the paths next to r 0 , since, 
this time, their phases remain practically equal. Consequently, when the actions S r 
are much larger than Ti, one is in a “quasi-classical” situation: to obtain K(2, 1), 
one can ignore all the paths except r 0 and paths infinitely close to it; it can then 
be said that, between points 1 and 2, the particle follows the trajectory T 0 . Now this 
is indeed the classical trajectory, defined by Hamilton’s principle as being the path 
along which the action is minimal. Feynman’s postulates therefore include, at the 
classical limit, Hamilton’s principle of least action. They enable, us, moreover, to 
associate with it the following picture: it is the wave associated with the particle 
which, “exploring” the various possible paths, picks the one for which the action 
will be the smallest. 

The Lagrangian formulation of quantum mechanics presents numerous 
other advantages, which we shall not examine in detail. Let us point out, for 
example, that it lends itself easily to relativistic generalization since one is already 
reasoning in space-time. Moreover, it can be applied to any classical system 
(not necessarily mechanical) governed by a variational principle (for example, a 
field). 

However, it has a certain number of disadvantages on the mathematical 
level (summation over an infinite number of paths, the limit N —> oo,...). 


References and suggestions for further reading: 

Feynman’s original article (2.38); Feynman and Hibbs (2.25); Bjorken and 
Drell (2.6), chaps. 6 and 7. 
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1. Introduction 

2. Definition of the lifetime 

3. Phenomenological description of the instability of a state 


I. Introduction 


Consider a conservative system (a system whose Hamiltonian H is time- 
independent). Assume that at time t = 0 the state of the system is one of the 
eigenstates | (p„ > of the Hamiltonian, of energy E n : 

I 0)> = | (p„> (1) 

with: 

H \<P» > = E n \<P, > (2) 

In this case, the system remains indefinitely in the same state (a stationary 
state; §D-2-b of chapter III). 

We shall study the hydrogen atom in chapter VII by solving the eigenvalue 
equation of its Hamiltonian, which is a time-independent operator. The states of 
the hydrogen atom (that is, the possible values of its energy) which we shall find 
are in very good agreement with the experimentally measured energies. However, 
it is known that most of these states are actually unstable : if, at the instant t = 0, 
the atom is in an excited state (an eigenstate | <p„ ) corresponding to an energy E n 
greater than that of the ground state, which is the lowest energy state), it generally 
“falls back” into this ground state by emitting one or several photons. The 
state | q > n ) is not really, therefore, a stationary state in this case. 

This problem arises from the fact that, in calculations of the type used 
in chapter VII, the system under study (the hydrogen atom) is treated as if it were 
totally isolated, while it is actually in constant interaction with the electromagnetic 
field. Although the evolution of the global system “atom + electromagnetic field” 
can be perfectly well described by a Hamiltonian, it is not rigorously possible to 
define a Hamiltonian for the hydrogen atom alone [cf. comment (v) of §5-b of 
complement E„,]. However, since the coupling between the atom and the field 
happens to be weak (it can be shown that its “force” is characterized by the fine 

structure constant a ~ yyy, which we shall introduce in chapter VII), the approxi¬ 
mation consisting of completely neglecting the existence of the electromagnetic 
field is very good, except, of course, if we are interested precisely in the instability 
of the states. 
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COMMENTS : 

(/') If a strictly conservative, isolated system, at the initial instant, is in a state 
formed by a linear combination of several stationary states, it evolves over 
time and does not always remain in the same state. But its Hamiltonian is 
a constant of the motion, and consequently (c/. chapter III, §D-2-c), the 
probability of finding one energy value or another is independent of time, as 
is the mean value of the energy. On the other hand, in the case of an unstable 
state, one state is transformed irreversibly into another, with a loss of energy 
for the system: this energy is taken away by the photons emitted*. 

(n) The instability of the excited states of an atom is caused by the spontaneous 
emission of photons; the ground state is stable, since there exists no lower 
energy state. Recall, nevertheless, that atoms can also absorb light energy 
and so ascend to a higher energy level. 

We intend to indicate here how to take the instability of a state into account 
phenomenologically. The description will not be rigorous, as we shall continue 
to consider the system as if it were isolated. We shall try to incorporate this instability 
as simply as possible into the quantum description of the system. 

Complement D XI „ presents a more rigorous treatment of this problem, 
justifying the phenomenological approach used here. 


2. Definition of the lifetime 


Experiments show that the instability of a state can often be characterized 
by just one parameter t, having the dimensions of a time, which is called the lifetime 
of the state. More precisely, if one prepares the system at time t = 0 in the 
unstable state | q> n ), one observes that the probability !?(t) of its still being excited 
at a later time t is equal to: 

&{t) = e _,/t (3) 


This result can also be expressed in the following way. Consider a large 
number of identical independent systems, all prepared at time t = 0 in the 
state | q> n >. At time t, there will remain N(t) = Jf z~' ,x in this state. Between 
times t and t + dr, a certain number d n(t) of systems leave the unstable state: 

d n(t) = N(t) - N(t + dt) = - dt = N(r)^ (4) 


For each of the N(t) systems which are still in the state | <p„ > at time t, a probability 
can therefore be defined: 


dn;(f) = 


<M0 

N(t) 


dr 

T 


(5) 


of their leaving this state during the time interval dr following the instant r. We see 
that dtn(r) is independent of r: the system is said to have a probability per unit 

time — of leaving the unstable state. 


* which, moreover, may also take away linear and angular momentum. 
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Let us calculate the mean value of the time the system remains in the 
unstable state. It is equal to: 



t is therefore the mean time the system spends in the state | cp n ); this is why 
it is called the lifetime of this state. 

For a stable state, t?(t) is always equal to 1, and the lifetime t is infinite. 

(/'/') A remarkable property of the lifetime r is that it does not depend on the 

procedure used to prepare the system in the unstable state, that is, on its 
previous “history”: the lifetime is a characteristic of the unstable state itself. 

(iii) According to the time-energy uncertainty relation (§D-2-e of chapter III), 
the time r characteristic of the evolution of an unstable state is associated with 
an uncertainty in the energy AE given by: 


at: * 

AE ~ — 

T 


(7) 


One indeed finds that the energy of an unstable state cannot be determined 
with arbitrary accuracy, but only to within an uncertainty of the order of AE. 
AE is called the natural width of this state. For the case of the hydrogen atom, 
the width of the various states is negligible compared to their separation. 
This explains why we can treat them, in a first approximation, as if they were 
stable. 


3. Phenomenological description of the instability of a state 

First let us consider a conservative system, prepared, at the initial time, in the 
eigenstate | <p n > of the Hamiltonian H. According to the rule (D-54) of chapter III, 
the state vector, at time /, becomes: 

I <A(0> = e" i£ " ,/# | <p M > (8) 

The probability &„(t) of finding, in a measurement at time t, the system in the 
state | > is: 

*.(0 = |e-“-‘ / f (9) 

Since the energy E n is real (since H is an observable), this probability is constant 
and equal to 1: we again find that | <p n > is a stationary state. 

Let us examine what would happen if, in expression (9), we replaced the 
energy E n by the complex number: 

E' n = E n -ih ^ (10) 

The probability ^„(t) then becomes: 

& n (i) = |e" i(E "“ ifi f ),/fi | 2 = e -1 ’”* (11) 
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In this case, the probability of finding the system in the state | cp n > decreases 
exponentially with time, as in formula (3). Therefore, to take into account 
phenomenologically the instability of a state | q>„ > whose lifetime is t„, it suffices to 
add, as in (10), an imaginary part to its energy, by setting: 



comment: 

When E n is replaced by E'„, the norm of the state vector written in (8) 
becomes e -y "' /2 and therefore varies with time. This result is not surprising. 
We saw in § D-l-c of chapter III that the conservation of the norm of the state 
vector arose from the Hermitian nature of the Hamiltonian operator; now, 
an operator whose eigenvalues are complex, as are the E ' n ,cannot be Hermitian. 
Of course, as we pointed out in § 1, this is due to the fact that the system under 
study is part of a larger system (it is interacting with the electromagnetic 
field) and its evolution cannot be described rigorously by means of a 
Hamiltonian. It is already rather remarkable that its evolution can be simply 
explained by introducing a “ Hamiltonian ” with complex eigenvalues. 
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1 . 


In a one-dimensional problem, consider a particle whose wave function is: 


i p(x) = N 


gipo x/Jt 

J x 2 + a 2 


where a and p 0 are real constants and TV is a normalization coefficient. 

a. Determine N so that p(x) is normalized. 

b. The position of the particle is measured. What is the probability of finding 

a result between-^ and + -4= ? 

V 3 V 3 

c. Calculate the mean value of the momentum of a particle which has ip(x) for 
its wave function. 


2. Consider, in a one-dimensional problem, a particle of mass m whose wave 
function at time t is ip(x, t). 

a. At time t, the distance d of this particle from the origin is measured. 
Write, as a function of ij/(x, t), the probability @{d 0 ) of finding a result greater 
than a given length d 0 . What are the limits of &{d Q ) when d 0 —> 0 and d 0 — > oo ? 

b. Instead of performing the measurement of question a, one measures the 
velocity v of the particle at time t. Express, as a function of p(x, t), the probability 
of finding a result greater than a given value v 0 . 


3. The wave function of a free particle, in a one-dimensional problem, is given 
at time t = 0 by : 

/» + OO 

ip(x, 0) = AM dfce“ |k,/to e itx 

J - OO 

where k 0 and N are constants. 

a. What is the probability &{p l , 0) that a measurement of the momentum, 

performed at time t = 0, will yield a result included between — p x and + p x ? 
Sketch the function 0). 

b. What happens to this probability &{p t , t) if the measurement is performed 
at time t ? Interpret. 

c. What is the form of the wave packet at time t = 0? Calculate for 
this time the product AX.AP; what is your conclusion? Describe qualitatively 
the subsequent evolution of the wave packet. 
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4. Spreading of a free wave packet 

Consider a free particle. 

a. Show, applying Ehrenfest’s theorem, that (X} is a linear function of 
time, the mean value < P > remaining constant. 

b. Write the equations of motion for the mean values < X 2 ) and < AT + PXy. 
Integrate these equations. 

c. Show that, with a suitable choice of the time origin, the root-mean- 
square deviation AX is given by : 

(AX) 2 =—(AP) 2 t 2 +(AX) 0 2 
m 2 

where (AX) 0 and (AP) 0 are the root-mean-square deviations at the initial time. 

How does the width of the wave packet vary as a function of time (see § 3-c 
of complement G,)? Give a physical interpretation. 


5. Particle subject to a constant force 


In a one-dimensional problem, consider a particle of potential energy 
V(X) = — fX, where / is a positive constant [V(X) arises, for example, from 
a gravity field or a uniform electric field], 

a. Write Ehrenfest’s theorem for the mean values of the position X and the 
momentum P of the particle. Integrate these equations; compare with the classical 
motion. 


b. Show that the root-mean-square deviation AP does not vary over time. 

c. Write the Schrodinger equation in the { |p> } representation. Deduce 


from it a relation between — | < p | if/ (f) > | 2 and — I < p 

ot 8p 

equation thus obtained; give a physical interpretation. 


ip ( t ) > | 2 . Integrate the 


6 . Consider the three-dimensional wave function 

_rM+k! Jd] 
y, z) — N e L2« 2 b 2 c J 

where a, b and c are three positive lengths. 

a. Calculate the constant N which normalizes ip. 

b. Calculate the probability that a measurement of X will yield a result 
included between 0 and a. 

c. Calculate the probability that simultaneous measurements of Y and Z will 
yield results included respectively between — b and + b, and — c and + c. 

d. Calculate the probability that a measurement of the momentum will yield 
a result included in the element dp x dp y dp z centered at the point p x — p y = 0; 

Pz = fi/c. 


342 




EXERCISES 


7. Let if/ ( x , y, z) = i^(r) be the normalized wave function of a particle. Express 
in tefms of ^(r) the probability for: 

a. a measurement of the abscissa X, to yield a result included between x, 
and x 2 ; 

b. a measurement of the component P x of the momentum, to yield a 
result included between p l and p 2 ; 

c. simultaneous measurements of X and P z , to yield: 

x l < jc < x 2 
Pz > 0 

d. simultaneous measurements of P x , P y , P z , to yield : 

Pi^P x < Pi 
Pi < P y < Px 
Pi^Pz^ Pf> 

Show that this probability is equal to the result of b when p 3 , p 5 —> - oo ; 
Pi,P 6 + oo; 

e. a measurement of the component U = ~= (X + Y + Z) of the position, 

V 3 

to yield a result included between u l and u 2 . 


8. Let J(r) be the probability current associated with a wave function 
describing the state of a particle of mass m [chap. Ill, relations (D-17) and (D-19)]. 

a. Show that: 
m f d 3 r J(r) = <P> 


where < P ) is the mean value of the momentum. 

b. Consider the operator L (orbital angular momentum) defined by 
L = R x P. Are the three components of L Hermitian operators? Establish the 
relation: 

m fd 3 r[r x J(r)] = <L>. 


9. One wants to show that the physical state of a (spinless) particle is completely 
defined by specifying the probability density p(r) = |i/'(r)| 2 and the probability 
current J(r). 

a. Assume the function i^(r) known and let £(r) be its argument: 
ip(r) = vp(r) e i4(r) 

Show that: 

J(r) = \ P(*) Vf(r) 
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Deduce that two wave functions leading to the same density p(r) and current J(r) 
can differ only by a global phase factor. 

b. Given arbitrary functions p(r) and J(r), show that a quantum state ij/(r) 
can be associated with them only if V x v(r) = 0, where v(r) = J(r)/p(r) is the 
velocity associated with the probability fluid. 

c. Now assume that the particle is submitted to a magnetic field B(r) = V x A(r) 
[see chap. Ill, definition (D-20) of the probability current in this case]. Show that : 

J = ^[#T{( r) - 4 A(')] 

and: 

V x v(r) = —— B(r) 

' ' m 


10. Virial theorem 


a. In a one-dimensional problem, consider a particle with the Hamiltonian: 

H = £, + v w 

where: 

V(X) = XX” 

Calculate the commutator [//, XP\ If there exists one or several stationary 
states |<p) in the potential V, show that the mean values < T) and (V) of the 
kinetic and potential energies in these states satisfy the relation: 2< T) = n( F). 

b. In a three-dimensional problem, H is written: 

H=^+r ( R, 

Calculate the commutator [//, R . P], Assume that V (R) is a homogeneous function 
of nth order in the variables X, Y , Z. What relation necessarily exists between the 
mean kinetic energy and the mean potential energy of the particle in a stationary 
state ? 

Apply this to a particle moving in the potential F(r) = — e 2 /r (hydrogen 
atom). 

Recall that a homogeneous function V of nth degree in the variables x, y and z 
by definition satifies the relation: 

V(ax , ay, az) = a"V(x, y, z) 


and satisfies Euler's identity: 


dV dV bV 

x -^ + y l^ + z TF = " v ^ y -^ 


c. Consider a system of N particles of positions R, and momenta P, 
(t = 1, 2,... N). When their potential energy is a homogeneous (nth degree) func¬ 
tion of the set of components X t , Y t , Z ; , can the results obtained above be genera- 
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lized? An application of this can be made to the study of an arbitrary molecule 
formed of nuclei of charges — Z$ and electrons of charge q. All these particles 
interact by pairs through Coulomb forces. In a stationary state of the molecule, 
what relation exists between the kinetic energy of the system of particles and their 
energy of mutual interaction ? 


11. Two-particle wave function 

In a one-dimensional problem, consider a system of two particles (1) and (2) 
with which is associated the wave function if/(x 1 , x 2 )■ 

a. What is the probability of finding, in a measurement of the positions A', 
and X 2 of the two particles, a result such that: 

x < x l ^ x 4- dx 

a < *2 < P 

b. What is the probability of finding particle (1) between x and x + dx [when 
no observations are made on particle (2)] ? 

c. Give the probability of finding at least one of the particles between a and /?. 

d. Give the probability of finding one and only one particle between a and /?. 

e. What is the probability of finding the momentum of particle (1) included 
between p' and p" and the position of particle (2) between a and /l? 

/. The momenta P v and P 2 of the two particles are measured; what is the 
probability of finding/?' ^ p 2 ^ /?"; p'" < p 2 ^ /?""? 

g. The only quantity measured is the momentum P 1 of the first particle. 
Calculate, first from the results of e and then from those of /, the probability of 
finding this momentum included between p' and p". Compare the two results 
obtained. 

h. The algebraic distance X l — X 2 between the two particles is measured; 
what is the probability of finding a result included between - d and + J? What is 
the mean value of this distance? 


12. Infinite one-dimensional well 


Consider a particle of mass m submitted to the potential: 

V(x) = 0 if 0 ^ x ^ a. 

V(x) = + oo if x < 0 or x > a. 


\ <p„ > are the eigenstates of the Hamiltonian H of the system, and their eigenvalues 
n 2 n 2 h 2 


are E„ = . (cf. complement H,). The state of the particle at the instant / = 0 

2 ma 
is: 


| tl/(0) > = a v | q>i > + a 2 \ <p 2 > + a 3 | (p 3 > + a A | <p 4 > 
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a. What is the probability, when the energy of the particle in the state | ^(0)> 

3ti^Ti2 

is measured, of finding a value smaller than-? 

ma 2 

b. What is the mean value and what is the root-mean-square deviation of 
the energy of the particle in the state 1 1 ^( 0 )> ? 

c. Calculate the state vector 1 1 j/(t )) at the instant t. Do the results found 
in a and b at the instant / = 0 remain valid at an arbitrary time t ? 

d. When the energy is measured, the result-is found. After the measure- 

ma 2 

ment, what is the state of the system? What is the result if the energy is measured 
again ? 


13. Infinite two-dimensional well (cf. complement G„) 

In a two-dimensional problem, consider a particle of mass m ; its Hamilto¬ 
nian H is written: 

H = H r + H 


H r - 


H > - £ + K < y > 


The potential energy V(x) [or V(y)] is zero when x (or y) is included in the 
interval [0, a ] and is infinite everywhere else. 

a. Of the following sets of operators, which form a C.S.C.O. ? 

{H},{H x },{H x ,H y },{H,H x \ 

b. Consider a particle whose wave function is: 

(/ , .. 7rx ny . 2nx . 2ny 

mx, y) = N cos — cos — sin-sin —- 

a a a a 

when 0 ^ x a and 0 < y < a, and is zero everywhere else (where A is a constant). 

a. What is the mean value < H ) of the energy of the particle ? If the energy H 
is measured, what results can be found, and with what probabilities ? 

p. The observable H x is measured; what results can be found, and with what 

n 2 ft 2 

probabilities? If this measurement yields the result-,what will be the results 

2 ma 2 

of a subsequent measurement of H y , and with what probabilities? 

y. Instead of performing the preceding measurements, one now performs 
a simultaneous measurement of H x and P y . What are the probabilities of finding : 
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and : 

Po < P y < Po + d/> ? 


14. Consider a physical system whose state space, which is three-dimensional, 
is spanned by the orthonormal basis formed by the three kets | m, ), | u 2 ), | w 3 >. 
In this basis, the Hamiltonian operator H of the system and the two observables A 
and B are written : 


1 

0 



(\ 

0 



(0 

1 

0\ 

0 

2 

0 ; 

A = a 

0 

0 

1 ; 

B = b 

1 

0 

0 

Vo 

0 

2/ 


\0 

1 

0/ 


Vo 

0 

1/ 


where a > 0 , a and b are positive real constants. 

The physical system at time t = 0 is in the state: 

I *A(0) ) = I M i ) + 2 I u 2 ) + 2 I m 3 ) 

a. At time t = 0, the energy of the system is measured. What values 
can be found, and with what probabilities? Calculate, for the system in the 
state | (/'(O)), the mean value < //> and the root-mean-square deviation AH. 

b. Instead of measuring H at time t = 0, one measures A ; what results can 
be found, and with what probabilities ? What is the state vector immediately after 
the measurement ? 

c. Calculate the state vector 1 1 p(t)} of the system at time I. 

d. Calculate the mean values < A)(t) and < B )(/) of A and B at time t. What 
comments can be made ? 

e. What results are obtained if the observable A is measured at time t ? 
Same question for the observable B. Interpret. 


15. Interaction picture 

(It is recommended that complement F, n and perhaps complement G m be 
read before this exercise is undertaken.) 

Consider an arbitrary physical system. Denote its Hamiltonian by H 0 (t) and 
the corresponding evolution operator by U 0 (t, /'): 

I ihj f U 0 (t,t 0 ) = H 0 (t)U 0 (t,t 0 ) 

^o( f O’ f o) = ^ 

Now assume that the system is perturbed in such a way that its Hamiltonian 
becomes: 

H(t) = H 0 (t) + W(t) 
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The state vector of the system in the “interaction picture”, |i l/ r (t)y, is defined 
from the state vector 1 1 p s (t)} in the Schrodinger picture by : 

|<M')> = Ufa t 0 ) I «// s (0> 

a. Show that the evolution of |i// 7 (t)> is given by : 

where W,(t) is the transform operator of fV(t) under the unitary transformation 
associated with Ul(l, t 0 ): 

w,(t) = ufat 0 )w(t)ufat 0 ) 

Explain qualitatively why, when the perturbation fV(t) is much smaller than H 0 (t), 
the motion of the vector |i/^(/)> is much slower than that of | 

b. Show that the preceding differential equation is equivalent to the integral 
equation: 

IM) > = I ^ f dt' W,(t') I MO > 

where : |<//,(t 0 )> = |^ s (/ 0 )>. 

c. Solving this integral equation by iteration, show that the ket | ^,{t )) can 
be expanded in a power series in W of the form: 

d t'WAO [ 


L MJ, o (»*)Jt 


d + H <A/(t 0 ) > 


■y 


16. Correlations between two particles 


(It is recommended that the complement E ni be read in order to answer 
question e of this exercise.) 

Consider a physical system formed by two particles (1) and (2), of the same 
mass m, which do not interact with each other and which are both placed in an 
infinite potential well of width a (cf. complement H,, § 2-c). Denote by H (\) and 
H(2) the Hamiltonians of each of the two particles and by | <p rt (l) > and | <p q (2 )) the 


n 2 n 2 Ti 2 

corresponding eigenstates of the first and second particle, of energies-— and 


2 ma 

. In the state space of the global system, the basis chosen is composed of 


q 2 it 2 fi 2 
2 ma 2 

the states | <p n (p q ) defined by: 


‘/VP, > = I flM 1 )) ® I <P„( 2 )> 


a. What are the eigenstates and the eigenvalues of the opera¬ 
tor H — H( 1) + H( 2), the total Hamiltonian of the system? Give the degree of 
degeneracy of the two lowest energy levels. 
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b. Assume that the system, at time t = 0 is in the state : 


m > =-Jf i Qi<pi > + 


| (Pl<Pl > + I <P 2 <P\ > + I V 2 V 2 > 


a. What is the state of the system at time t ? 

p. The total energy H is measured. What results can be found, and with 
what probabilities ? 

y. Same questions if, instead of measuring H, one measures H( 1). 

c. a. Show that | ij/( 0) > is a tensor product state. When the system is in this 
state, calculate the following mean values: <i/(l)>, ( H(2) ) and ( H{\)H(2) 
Compare <//(l)><//(2)> with < H(l)H(2) ); how can this result be explained? 

p. Show that the preceding results remain valid when the state of the 
system is the state | ifr(t) ) calculated in b. 

d. Now assume that the state | ij/(0 )) is given by : 


<A(o) > = 




<Pi<Pi > + lj\ 

V 5 


<PI<P2 > +~J^\<P2<PI > 


Show that | *^(0) > cannot be put in the form of a tensor product. Answer for 
this case all the questions asked in c. 

e. a. Write the matrix, in the basis of the vectors \(p„(p p }, which represents 
the density operator p(0) corresponding to the ket | \j/(0) ) given in b. What is the 
density matrix p(t) at time t ? Calculate, at the instant t = 0, the partial traces: 

p(l) = Tr 2 p and p(2) = Tr,p 


Do the density operators p, p( 1) and p( 2) describe pure states? Compare p with 
p(I) ® p( 2); what is your interpretation? 

p. Answer the same questions as in a, but choosing for |i/'(0)> the ket 
given in d. 

The subject of the following exercises is the density operator : they therejore 
assume the concepts and results of complement E m to be known. 


17. Let p be the density operator of an arbitrary system, where | ) and n l are the 

eigenvectors and eigenvalues of p. Write p and p 2 in terms of the | Xi ) and n,. What 
do the matrices representing these two operators in the {| Xi ) } basis look like — first, 
in the case where p describes a pure state and then, in the case of a statistical 
mixture of states ? (Begin by showing that, in a pure case, p has only one non-zero 
diagonal element, equal to 1, while for a statistical mixture, p has several diagonal 
elements included between 0 and 1.) Show that p corresponds to a pure case if and 
only if the trace of p 2 is equal to 1. 


18. Consider a system whose density operator is p(t), evolving under the 
influence of a Hamiltonian H{t). Show that the trace of p 2 does not vary over time. 
Conclusion : can the system evolve so as to be successively in a pure state and 
a statistical mixture of states ? 
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19 . Let (1) + (2) be a global system, composed of two subsystems (1) and (2). 
A and B denote two operators acting in the state space <?(1) ® <?(2). Show that the 
two partial traces Tr, { AB } and Tr x { BA } are equal when A (or B) actually 
acts only in the space <?(1), that is, when A (or B) can be written: 

A = A (1)® 11(2) [or B = 5(1)® H(2)]. 

Application : if the operator H, the Hamiltonian of the global system, is the sum 
of two operators which act, respectively, only in S’(l) and only in <f (2): 

H= H( 1) + H( 2), 

calculate the variation ^ p(l) of the reduced density operator p(l). Give the 
physical interpretation of the result obtained. 


Exercise 5 

References: Fliigge (1.24), §§40 and 41; Landau and Lifshitz (1.19), §22. 


Exercise 10 

References: Levine (12.3), chap. 14; Eyring et al (12.5), §18 b 


Exercise 15 

References : see references of complement G,„. 
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BOUND STATES IN A "POTENTIAL WELL" OF ARBITRARY SHAPE 


RETURN TO ONE-DIMENSIONAL PROBLEMS 


Now that we are more familiar with the mathematical formalism and the 
physical content of quantum mechanics, we can go into some of the results 
obtained in chapter 1 in more detail. In the three complements which follow, we 
shall study in a general way the quantum properties of a particle subject to a 
scalar potential* of arbitrary form, confining ourselves for simplicity to one¬ 
dimensional problems. We shall treat the bound stationary states of a particle, 
whose energies form a discrete spectrum (complement M in ), and then the unbound 
states corresponding to an energy continuum (complement N,„). In addition, we 
shall examine a special case which is very important because of its applications, 
particularly in solid state physics, that of a periodic potential (complement O,,,). 


Complement Mm 

BOUND STATES OF A PARTICLE 

IN A “POTENTIAL WELL'' OF ARBITRARY SHAPE 


1. Quantization of the bound state energies 

2. Minimum value of the ground state energy 


In complement H,, we studied, for a special case (finite or infinite “square” well), 
the bound states of a particle in a potential well. We derived certain properties 
of these bound states : a discrete energy spectrum and a ground state energy 
greater than the classical minimum energy. These properties are, in fact, general, 
and have numerous physical consequences, as we shall show in this complement. 

When the potential energy of a particle posesses a minimum (see figure 1-a), the 
particle is said to be placed in a “potential well”**. Before studying qualitatively 
the stationary states of a quantum particle in such a well, let us recall the 
corresponding motion of a classical particle. When its energy Ed takes on the mini¬ 
mum possible value E cl = — V 0 (where V 0 is the depth of the well), the particle is 
motionless at the point M 0 whose abscissa is x 0 . In the cas where — V 0 < E cl < 0, 
the particle oscillates in the well, with an amplitude which increases with E cl . Finally, 
when E cl > 0, the particle does not remain in the well, but moves off towards 
infinity. The “bound states” of the classical particle therefore correspond to all 
negative energy values between — V 0 and 0. 


* The effects of a vector potential A will be studied later, in particular in complement E Vh 

** The potential energy, of course, is only defined to within a constant. By convention, we set 
the potential equal to zero at infinity. 
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For a quantum particle, the situation is very different. Well-defined energy 
states E are stationary states whose wave functions <p(x) are solutions of the 
eigenvalue equation of the Hamiltonian H: 

- — — 7 -r + F(x)l <p(x) = E <p(x) (1) 

2m dx J 

Such a second-order differential equation has an infinite number of solutions, 
whatever the value chosen for E : if we pick arbitrary values of cp(x) and its 
derivative at any given point, we can obtain cp for any other value of x. Equation (1) 
alone cannot, therefore, restrict the possible energy values. However, we shall show 
here that if, in addition, we impose certain boundary conditions on (p{x), only 
a certain number of values of E remain possible (quantization of energy levels). 


1. Quantization of the bound state energies 


We shall call “bound states of the particle” states whose wave functions <p(x) 
satisfy the eigenvalue equation ( 1 ) and are square-integrable [indispensable if cp(x) 
is actually to describe the physical state of a particle]. These are therefore stationary 
states, for which the position probability density |q>(jc )| 2 takes on non-negligible 


values only in a limited region of space [for 


/» + 00 

dx |<p(x )| 2 to converge, |<p(x )| 2 must 

J — oo 


approach zero sufficiently rapidly when x —> + 00 ]. Bound states remind us of 
classical motion where the particle oscillates inside the well without ever being 
able to emerge (energy E cl negative, but greater than — V 0 ). 

We shall see that in quantum mechanics, the fact that cp(x) is required to be 
square-integrable implies that the possible energies form a discrete set of values 
which are also included between — V 0 and 0. To understand this, let us return to 
the potential shown in figure 1-a. For simplicity, we shall assume that V(x) is 
identically equal to zero outside an interval [x t , x 2 ]. If x < (region I), F(x) = 0 , 
and the solution to equation ( 1 ) can immediately be written: 


- if E > 0: 


<p,(x) = A e ikx + A' e~ ikx 

with: 

, I2mE 

k = I —— 

V ft 2 

- if E < 0: 


( 2 ) 

(3) 


<p,(x) = Be px + B'e~ px 


(4) 


with: 

! 2 m E 

p = 1 —- 

v t 2 


(5) 
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We are looking for a square-integrable solution; we must therefore eliminate 
the form ( 2 ) in which <p,(x) is a superposition of plane waves of constant modulus 
which cause the integral: 

dx |<p,(x )| 2 ( 6 ) 

J — 00 

to diverge. Only possibility (4) remains, and we obtain our first result: the bound 
states of the particle all have a negative energy. In (4), we cannot retain the term 
in e~ px , which diverges when x —> — oo. We are therefore left with; 

<p,(x) = e px if x < x, (7) 

[We have omitted the proportionality factor B since the homogeneity of equation ( 1 ) 
allows us to define cp(x) to within a multiplicative coefficient]. 

The value of (p(x) in the interval < x ^ x 2 (region II) is obtained by 
extending </>j(x): we must look for the solution of equation ( 1 ) which is equal 
to e px ' for x = X! and whose derivative at this point is equal to p e pXi . The 
function (p„(x) thus obtained depends on p and, of course, on the exact expres¬ 
sion for V(x). Nevertheless, since (1) is a second-order differential equation, <Pn(x) 
is determined uniquely by the preceding boundary conditions; it is, moreover, 
real (which enables us to trace curves such as those in figures 1 -b, 1 -c and 1 -d). 

All that now remains to be done is to obtain the solution when x > x 2 
(region III); this solution can be written: 

<D m (x) = Bq px -t- B'e~ px ( 8 ) 

where B and B' are real constants determined by the two continuity conditions 
for (p(x) and d<p/dx at the point x = x 2 . B and B' depend on p, as well as on the 
function V(x). 

We have therefore constructed a solution of equation ( 1 ), such as the one 
shown in figure 1 -b. Is this solution square-integrable? We see from (8) that, in 
general, it is not, except when B is zero (this special case is shown in figure 1-c). 
Now, for a given function V (x), B is a function of E through the intermediary of p. 
The only values of E for which a bound state exists are therefore solutions of the 

equation S(E) = 0. These solutions E y ,E 2 ,... (cf. fig. 2 ) form a discrete spectrum 
which, of course, depends on the potential V(x) chosen (we shall see in the following 
section that all the energies E t are greater than — V 0 ). 

We thus arrive at the following result : the bound state energy values 
possible for a particle placed in a potential well of arbitrary shape form a discrete 
set (it is often said that the bound state energies are quantized). This result can be 
compared to the quantization of electromagnetic modes in a cavity. There is no 
analogue in classical mechanics, where, as we have seen, all energy values 
included between — V 0 and 0 are acceptable. In quantum mechanics, the lowest 
energy level E x is called the ground state , the energy level E 2 immediately 
above, the first excited state, the next energy level E 3 , the second excited state, etc. 
The following schematic diagram is often associated with each of these states: 
inside the potential well representing V(x), a horizontal line is drawn whose 
vertical position corresponds to the energy of the state and whose length gives 
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FIGURE 2 

Graphical representation of the function B(E\ The zeros of B(E) give the values of E for which <p(x) 
is square-integrable (the situation in figure 1-c), that is, the energies E 29 ... of the bound 

states; all these energies are included between — V 0 and 0. 


an idea of the spatial extension of the wave function (this line actually covers 
the points of the axis which would be reached by a classical particle of the 
same energy). For the set of energy levels, we obtain a schematic diagram of 
the type shown of figure 3. 

As we saw in chapter I, the phenomenon of energy quantization was one 
of the factors which led to the introduction of quantum mechanics. Discrete energy 
levels appear in a very large number of physical systems : atoms (cf. chap. VII, 
hydrogen atom), the harmonic oscillator ( cf\ chap. V), atomic nuclei, etc. 



FIGURE 3 

Schematic representation of the bound states of a particle in a potential well. For each of these sta¬ 
tionary states, one draws a horizontal line whose ordinate is equal to the energy of the corresponding 
level. The ends of this line are the points of intersection with the curve which represents the poten¬ 
tial V(x); that is, the line is confined to the region of classical motion for the same energy; this gives 
an idea of the extension of the wave function. 
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2. Minimum value of the ground state energy 


In this section, we shall show that the energies E x , E 2 , etc... are all 
greater than the minimum value — V 0 of the potential energy V(x). We shall then 
see how this result can be easily understood using Heisenberg’s uncertainty 
relation. 

If <jp(x) is a solution of (1), we obtain, multiplying this equation by 
q>*(x) and integrating the relation thus obtained: 

[ dx (p*(.x)<p(x) + f dxV(x) | (p{x)\ 2 

2'» J-oo dx 2 J-* 

f* - 1 - 00 

= £ dx |^(x)| 2 (9) 


For a bound state, the function y(x) can be normalized, and equation (9) 
can then be written simply: 


E = < T) + < vy 


( 10 ) 


with: 


< T> 



dx <p*(x) 


d 2 

dx 2 


cp(x) 




( 11 ) 


[where we have, performed an integration by parts and used the fact that 
cp(x) goes to zero when |x| —*• oo] and: 


< l 7 > = 


/• + on 


dx V(x) |<p(x)| 2 

J — oc 


( 12 ) 


Relation (10) shows simply that E is the sum of the mean value of the kinetic 
energy: 

= ,n) 

and that of the potential energy: 

(V> = <,cp\V(X)\<p) (14) 

From relations (11) and (12), it follows immediately that: 

<^>>0 (15) 

/» 4* on 

<F>^ dx (- V 0 )\(p(x)\ 2 = - V 0 (16) 
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Consequently: 

e = < r> + < vy > < v) > - v 0 (17) 

Since E is negative, as we showed in § 1 , we see that, as in classical mechanics , the 
bound state energies are always between — V 0 and 0. 

There exists, nevertheless, an important difference between the classical 
and quantum situations : while, in classical mechanics, the particle can have 
an energy equal to — V 0 (case of a particle at rest at M 0 ) or slightly greater 
than - V 0 (case of small oscillations), the same is not true in quantum mechanics, 
where the lowest possible energy is the energy E t of the ground state, which 
is necessarily greater than — V 0 (cf. fig. 3). The Heisenberg uncertainty relations 
enable us to understand the physical origin of this result, as we shall now show. 

If we try to construct a state of the particle for which the mean potential 
energy is as small as possible, we see from ( 12 ) that we must choose a wave 
function which is practically localized at the point M 0 . The root-mean-square 
deviation AX is then very small, so AP is necessarily very large. Since: 

{P 2 > = (AP) 2 + <P > 2 > (AP) 2 (18) 

the kinetic energy < T > = < P 2 >/2 m is then also very large. Therefore, if the 
potential energy of the particle approaches its minimum, the kinetic energy 
increases without bound. The wave function of the ground state corresponds to a 
compromise, for which the sum of these two energies is a minimum. The ground 
state of the quantum particle is thus characterized by a wave function which 
has a certain spatial extension {cf. fig. 3), and its energy is necessarily greater 
than — V 0 . Unlike the situation in classical mechanics, there exists no well- 
defined energy state in quantum mechanics where the particle is “at rest” at the 
bottom of the potential well. 


comment : 

Since the energy of the bound states is included between — V 0 and 0, 
such states can exist only if the potential V{x) takes on negative values 
in one or several regions of the *-axis. This is why we have chosen for 
this complement a potential “well” like the one shown in figure 1-a 
(while in the following complement, we shall not confine ourselves to the 
case of a potential well). 

However, there is nothing to prevent V(x) from being positive for 
certain values of jc; for example, the “well” can be surrounded by potential 
“barriers” as is shown in figure 4 (we shall always assume the potential to 
be zero at infinity). In this case, certain classical motions of positive 
energy remain bounded, while in quantum mechanics, the same reasoning as 
above shows that the bound states always have an energy between — V 0 
and 0. Physically, this difference arises from the fact that a potential 
barrier of finite height is never able to make a quantum particle turn back 
completely: the particle always has a non-zero probability of passing through 
by the tunnel effect. 
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in 



FIGURE 4 

Potential well of depth - V 0 situated between two potential barriers of height V x and V 2 (assuming, 
for example, V t ^ V 2 ). Classically, there exist particle states whose energy is between — V 0 and V x 
which remain confined between the two barriers. In quantum mechanics, a particle whose energy 
is between 0 and V t can penetrate the barrier by the tunnel effect; consequently, the bound states 
always have energies between — V 0 and 0. 


References and suggestions for further reading: 

Feynman III (1.2), §16-6; Messiah (1.17), chap. Ill, §11; Ayant and Belorizky 
(1.10), chap. IV, §§1, 2, 3; Schiff (1.18), §8. 
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Complement N, n 

UNBOUND STATES OF A PARTICLE IN THE PRESENCE 
OF A POTENTIAL WELL OR BARRIER OF ARBITRARY SHAPE 


1. Transmission matrix M(k) 

a. Definition of M(k ) 

b. Properties of M(k ) 

2. Reflection and transmission coefficients 

3. Example 


In complement M,„, we showed that bound states of a particle placed in 
a potential V(x) have negative energies* and that they exist only if F(x) is an 
attractive potential (a potential well which allows classical bounded motion). 
We had to reject positive energy values since they led to eigenfunctions <p k (x) 
of the Hamiltonian H which, at infinity, behaved like superpositions of non- 
square-integrable exponentials e ±,kx . Nevertheless, we saw as early as chapter I, 
that, by superposing such functions linearly, one can construct square-integrable 
wave functions t//(x) (wave packets) which can therefore represent the physical 
state of a particle. It is clear that, since the states thus obtained involve several 
values of k (that is, of the energy), they are no longer stationary states; the 
wave function if/(x) therefore evolves over time, propagating and becoming deformed. 
However, the fact that ^(x) is already expanded in terms of the eigenfunctions 
cp k (x) enables us to calculate this evolution very simply [as we did, for example, 
in complement J„ where we used the properties of the cp k (x) to calculate the 
transmission and reflection coefficients of a potential barrier, the delay upon 
reflection, etc.]. This is why, despite the fact that each of the <p t (x) cannot alone 
represent a physical state, it is useful to study the positive energy eigenfunctions** 
of H, as we have already done, in complement H,, for certain square potentials. 

In this complement, we are going to study in a general way (confining 
ourselves, nevertheless, to one-dimensional problems) the effect of a potential 
V(x) on the positive energy eigenfunctions (p k (x). We shall assume nothing about 
the shape of V{x), which may present one or several barriers, wells, etc., except 
that V(x) goes to zero outside a finite interval [x l5 x 2 ] of the x-axis. We shall 
show that, in all cases, the effect of V(x) on the functions cp k {x) can be 
described by a 2 x 2 matrix, M(k), which possesses a certain number of general 
properties. We shall thus obtain various results which are independent of the 
shape of the potential V(x) chosen. For example, we shall see that the transmission 


* Recall that we chose the energy origin so as to make V(x) zero at infinity. 

** One might also consider studying the non-square-integrable negative energy eigenfunctions 
of H (those whose energies do not belong to the discrete spectrum obtained in complement M,,,). However, 
these functions diverge very rapidly (exponentially) at infinity, and one could not obtain square-integrable 
wave functions by linearly superposing them. 
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and reflection coefficients of a barrier (whether symmetrical or not) are the same 
for a particle coming from the left and for a particle of the same energy coming 
from the right. An additional aim of this complement N,„ is to serve as the point 
of departure for the calculations of complement O u „ in which we study the 
properties of a particle in a periodic potential F(x). 


1. Transmission matrix M(k) 


a DEFINITION OF M{k) 


In a one-dimensional problem, consider a potential V(x) which is zero 
outside an interval [x x , x 2 ] of length /, but which varies in an arbitrary way 
inside this interval (fig. i). We choose the x origin to be in the middle of the 
interval [x 2 , x 2 ], so as to have V{x) vary only for |x| < //2. The equation 
satisfied by every wave function <p(x) associated with a stationary state of energy E 
is: 



+ f [E-VM]] 


<p(x) = 0 


(1) 


In the rest of this complement, we shall choose, to characterize the energy, 
the parameter k given by: 



( 2 ) 



FIGURE 1 

The potential V(x) under 
consideration varies in an 
arbitrary way within the 
interval - //2 x //2 
and goes to zero outside 
this interval. 


In the region x < — —, the function e'** satisfies equation (1); let us call 


v k (x) the solution of this equation that is identical to e ikx for x < 
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When x > + —, v k (x) is necessarily a linear combination of two independent 
solutions e 1 ** and e~ ,kx of (1). This gives us: 


/ 

if x < - - : 

■r 1 

if x > + - : 


v k (x) = e ikx 

v k (x) = F(k) e ikx + G(k)e~ ikx 


(3-a) 

(3-b) 


where F(k) and G(k) are coefficients which depend on k, as well as on the 
shape of the potential under study. Similarly, we can introduce the solution 
i^(x), which, for x < — // 2 , is equal to e~ ,kx : 


if x < - l - \ t4(x) = e~ ikx 

if x > +y. v' k (x) = F'(k) e ikx + G'(k) e~ ikx 


(4-a) 

(4-b) 


The most general solution <p k (x) of equation (1) (of second order in x), 
for a given value of E (that is, of k), is a linear combination of v k and v k : 

<p k (x) = A p k (x) + A'v' k {x) (5) 

Relations (3-a) and (4-a) imply that: 

if x < - ^ : <P k ( x ) = A e ikx + A' e~ ikx ( 6 -a) 

while relations (3-b) and (4-b) yield: 

if x > +y. <p k {x) = A e ikx + A' e~ ikx ( 6 -b) 

with: 

A = F(k)A + F'{k)A' 

A'= G(k) A + G'(k)A' (7) 

By definition, the matrix M(k) is the 2x2 matrix: 



which allows us to write relations (7) in the matrix form: 



M(k) therefore enables us to determine, given the behavior ( 6 -a) of the wave 
function to the left of the potential, its behavior ( 6 -b) to the right. We call M(k) 
the “ transmission matrix ” of the potential. 
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COMMENT: 


The current associated with a wave function (p(x ) is: 


JW = 2Si - *<*)^] 

(10) 

Differentiating, we find: 


d u x fi f ifi ^d 2 (p / % d 2 (p* 

J(x) = V*(x) <p(x) 

dx 2 mi L dx dx 

(ri) 

Taking (1) into account, we obtain: 


e-l* 

II 

o 

(12) 


Therefore, the current J(x) associated with a stationary state is the same 
at all points of the x-axis. Note, moreover, that (12) is simply the one¬ 
dimensional analogue of the relation: 

div J(r) = 0 (13) 

which is valid, according to relation (D-11) of chapter III, for any stationary 
state of a particle moving in three-dimensional space. According to (12), 
the current ^(x) associated with <p k {x) can therefore be calculated for any x, 
choosing either the form (6-a) or the form (6-b) of <p k (x) : 

= ’] 04 ) 


b. PROPERTIES OF M{k) 


a. It is easy to show, using the fact that the tunction V{x) is real, that 
if <p(x) is a solution of equation (1), <p*(x) is also. Now consider the function 
y*(x), which is a solution of (1); comparison of (3-a) and (4-a) shows that it is 


identical to v*(x) when x < — —. We therefore have, for all x: 




(15) 


Substituting relations (3-b) and (4-b) into this relation, we obtain : 

F*{k) = G'{k) (16) 

G*(k) = F'(k) (17) 


It follows that the matrix M(k) can be written in the simplified form: 

\G(k) F*(k)J 


(18) 


ft. We saw above [c/. (12)] that the probability current 7(x) does not 
depend on x for a stationary state. We must therefore have \cf. (14)]: 

Ml 3 - \Af = |i| 3 - \Af ,19) 
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for any A and A'. Now relations (9) and (18) yield: 

\A\ 2 - \A'\ 2 = \_F{k)A + G*{k)A'][F*{k)A* + G(k)A '*] 

- [ G{k)A + F*{k)A']\G*(k)A* + F(k)A'*~\ 

= [|W| 2 - WKMI 2 - Ml 2 ] (20) 

Condition (19) is therefore equivalent to: 

|F(£)| 2 - |G(£)| 2 = Det M(k) =1 (21) 


COMMENTS I 


(/) We have made no particular assumptions about the shape of the potential. 
If it is even, that is, if V(x) = V{— x), the matrix M(k) possesses an 
additional property : it can be shown that G(k) is a pure imaginary. 

(//) Relations (6) show that A and A’ are the coefficients of “incoming” 
plane waves, i.e. waves associated with particles arriving respectively from 
x = — oo and x = + oo and moving towards the zone of influence of the 

potential (incident particles). On the other hand, A and A', are the coef¬ 
ficients corresponding to “outgoing” waves, associated with particles moving 
away from the potential (transmitted or reflected particles). It is useful 
to introduce the matrix S, which allows us to calculate the amplitude 
of the outgoing waves in terms of that of the incoming waves: 

CHO ,22) 


S(k) can easily be expressed in terms of the elements of the matrix M(k), 
as we now show. The relations : 


A = F(k) A + G*{k)A' 
A' = G(k)A + F*(k)A' 
imply that: 


A' = 


1 


F*(k) 


[A' - G(k)A ] 


Substituting this relation into (23-a), we obtain: 

A = Tijtjll'W'W - C » C *MM + 

Taking (21) into account, we can then write the matrix S(A:): 
S(k) = 


F*(k) 


1 G*(k)\ 

G(k) 1 ) 


It is easy to verify, using (21) again, that: 
S{k) S'(k) = S'(k) S(k) = 1 


(23-a) 

(23-b) 

(24) 

(25) 

(26) 

(27) 


S(k) is therefore unitary. This matrix plays an important role in collision 
theory; we could have proved its unitary property from that of the 
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evolution operator (cf. complement F m \ which simply expresses the conser¬ 
vation over time of the total probability of finding the particle somewhere 
on the Ox axis (norm of the wave function). 


2. Transmission and reflection coefficients 


To calculate the reflection and transmission coefficients for a particle 
encountering the potential V(x), one should (as in complement J,) construct 
a wave packet with the eigenfunctions of H which we have just studied. Consider, 
for example, an incident particle of energy E t coming from the left. The correspon¬ 
ding wave packet is obtained by superposing functions <p k (x), for which we set A' = 0, 
with coefficients given by a function g(k) which has a marked peak in the 
neighborhood of k = k ( = JlmEJtr. We shall not go into these calculations 
in detail here; they are analogous in every way to those of complement J,. They 
show that the reflection and transmission coefficients are equal, respectively, to 
\A'{k i )IA(k i )\ 2 and |i(fc i )//l(£ I .)| 2 . 

Since A' = 0, relations (22) and (26) yield : 

A(k) = ikk) A{k) 

[) ( 28 ) 
A ' {k) = ~T$) A{k) 


The reflection and transmission coefficients are therefore equal to: 


R ,(*i) = 
T t {kd = 


m 


m 


iwi 2 


(29-a) 


(29-b) 


[it is easy to verify that condition (21) insures that + T^k^ = 1], 

If we now consider a particle coming from the right, we must take A = 0, 
which gives: 


A(k) = 
A'(k) = 


F*(k) [) 

1 


F*(k) 


A'(k\ 


(30) 


The transmission and reflection coefficients are now equal to : 


T 2 (k) = 


m 


A'(k) 


\F(kf 


and: 


R 2 (k) 


m 

2 

G(k) 2 

A'(k) 


F(k) 


(31-a) 


(31-b) 
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Comparison of (29) and (31) shows that 7](/c) = T 2 (k) and that /?,(£) = R 2 (k)‘- 
for a given energy, the transparency of a barrier (whether symmetrical or not) 
is therefore always the same for particles coming from the right and from 
the left. 

In addition, from (21) we have: 

|W| > 1 (32) 

When the equality is realized, the reflection coefficient is zero and the transmission 
coefficient is equal to 1 (resonance). On the other hand, the inverse situation 
is not possible : since (21) imposes that |F(A:)| > |G(&)|, one can never have 
T = 0 and R = 1 [except in the case where F and G tend simultaneously towards 
infinity]. Actually, such a situation can only occur for k = 0. To see this, divide the 
function t» k (.x) defined in (3) by F(k). If F{k) goes to infinity, the wave function will be 
identically zero on the left hand side, and hence necessarily, by extension, zero on the 
right hand side. However, this is impossible unless k = 0 and F — — G. 

3. Example 

Let us return to the square potentials studied in § 2-b of complement H|: 
in the region — 1/2 < x < +1/2, K(x) is equal to a constant V 0 * (see figure 2, 
where V 0 has been chosen to be positive). 

First, let us assume that E is smaller than V 0 , and set: 

P = V 2m(V 0 - E)/h 2 (33) 



FIGURE 2 

Square potential barrier. 


An elementary calculation analogous 


M(k) = 


It 


cosh pi +'l- 


2 kp 


sinh pi e 


I 


- ikl 


i ttt 2 - sinh pi 
2kp 


[ 


- i sinh pi 
2kp y 


cosh pi — i 


k 2 - p 2 
2 kp 


sinh pi e 




* In fact, we are considering here a barrier which is displaced relative to that of complement H,. 
since we are assuming it to be situated between x — - 111 and .v = + //2, instead of between v = 0 
and v = /. 
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with: 


*0 = / 


2mV 0 


V n 2 

(V Q is necessarily positive here, since we have assumed E < V 0 ). 
If now we assume that E > V 0 , we set: 


p2m 

k' = j-~j (E ~ V 0 ) 

v Ti 2 


and: 


k 0 = I e 


2 mV n 


V n 2 

(where e = + 1 if V 0 > 0 and — 1 if V 0 < 0). We thus obtain: 


M(k) = 


ft 


( 35 ) 


(36) 


(37) 


k 2 -|- k' 2 

cos k'l + i —tt-t;— sin k'l \ e~ tkl 


2kk f 

4 ™" 


k’l 


— ie-rrr, sin k'l 
2kk 


cos k'l — i 


.k 2 + k' 2 . "I 

wr smU \ 


Akl 


(38) 


It is easy to verify that the matrices M(k) written in (34) and (38) satisfy 
relations (16), (17) and (21). 


References and suggestions for further reading: 

Merzbacher (1.16), chap. 6, §§5, 6 and 8; see also the references of comple¬ 
ment M m . 
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Complement Om 

QUANTUM PROPERTIES OF A PARTICLE 

IN A ONE-DIMENSIONAL PERIODIC STRUCTURE 


1. Passage through several successive identical potential barriers 

a. Notation 

b. Matching conditions 

c. Iteration matrix Q(ct) 

d. Eigenvalues of Q{oc) 

2. Discussion : the concept of an allowed or forbidden energy band 

a. Behavior of the wave function (p a (x) 

b. Bragg reflection; possible energies for a particle in a periodic potential 
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b. Allowed energy bands: stationary states of the particle inside the lattice 

c. Forbidden bands: stationary states localized on the edges 


In this complement, we are going to study the quantum properties of a particle 
placed in a potential V(x) having a periodic structure. The functions V(x) which 
we shall consider will not necessarily be periodic in the strict sense of the term; 
it suffices for them to have the shape of a periodic function in a finite region 
of the x-axis (fig. 1), that is, to be the result of juxtaposing N times the same 
motif at regular intervals [ V(x) is truly periodic only in the limit N —*- oo]. 


V(x) 


► X 


FIGURE 1 

Potential V(x) having a periodic structure obtained by juxtaposing N times the same motif 
(N = 4 in the figure). 



Such periodic structures are encountered, for example, in the study of a 
linear molecule formed by N atoms (or groups of atoms) which are identical 
and equally spaced. They are also encountered in solid state physics, when 
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one chooses a one-dimensional model in order to understand the disposition 
of the energy levels of an electron in a crystal. If N is very large (as in the case 
of a linear macromolecule or a macroscopic crystal), the potential V(x) is 
given in a wide region of space by a periodic function, and the properties 
of the particle can be expected to be practically the same as they would be 
if V(x) were really periodic. However, from a physical point of view, the limit 
of infinite N is never attained, and we shall be concerned here with the case 
where N is arbitrary. 

To study the effect of the potential F(x) on an eigenfunction <p(x) of the 
Hamiltonian H, of eigenvalue E, we shall introduce a 2 x 2 matrix, the iteration 
matrix Q, which depends on E. We shall show that the behavior of <p(x) is 
totally different depending on whether the eigenvalues of the iteration matrix are 
real or imaginary. Since these eigenvalues depend on the energy E chosen, 
we shall find it useful to distinguish between domains of energy corresponding 
to real eigenvalues and those which lead to imaginary eigenvalues. The concept 
of an allowed or forbidden energy band will thus be introduced. 


comments : 


(/) For the sake of convenience, we shall speak of a “potential barrier” 
to designate the motif which, repeated N times, gives the potential 
V(x) (fig. 1). However, this motif can also be a “potential well” or 
have an arbitrary shape. 

(it) Common usage in solid state physics reserves the letter k to designate a 
parameter which is involved in the expression for stationary wave functions 
and which is not simply proportional to the square root of the energy. 
To conform to this usage, we shall henceforth use a notation slightly different 
from that of complement N,,,; we shall replace k by a, setting : 


a 


jlmE 

V fc 2 


( 1 ) 


and we shall not introduce the letter k until later (we shall see that k is 
directly related to the eigenvalues of the matrix Q when they are complex). 


1. Passage though several successive identical potential barriers 

Consider a potential V(x) which is obtained by juxtaposing N barriers as 
in figure 1 : the first barrier is centered at x = 0, the second, at x = /, the third, 
at x = 21, ..., the last at x = (N — 1)/. We intend to study the behavior, 
during passage through this set of barriers, of an eigenfunction cp x (x) which is a 
solution of the eigenvalue equation of H: 

{£; + F [£ “ ^ (x) 3 “ 0 < 2 > 

where E and a are related by (1). 


368 



PARTICLE IN A ONE-DIMENSIONAL PERIODIC STRUCTURE 


a NOTATION 

To the left of the N barriers, that is, for x < — -j, F(x) is zero, and 
the general solution of equation (2) is: 

<pjx) = A 0 e''“ + A' 0 e-'« (3-a) 

Consider, as in§l-a of complement N„„ the two functions v k (x) and 
i>*(x) which here become i? a (x) and u'(x). In the region of the first barrier, 
centered at x = 0, the general solution of (2) is written: 

if ~ ^ < x < ^ : (p a (x) = /l,r„(x) + A\ v'Jx) (3-b) 

Similarly, in the region of the second barrier, centered at x = /, we obtain: 

if j ^ x y : <p a (x) = A 2 v a (x - l) + A\ v' a (x -/) (3-c) 

and, more generally, in the region of the nth barrier, centered at x = (n — 1)/: 

if (n - 1)/ - ^ < x ^ (n - 1)/ + ^ : 

<P a (x) = A„ t? a [x - (« - 1)/1 + A’„v’ a [x - {n - 1)/] (3-d) 

Finally, to the right of the N barriers, that is, for x > (N — 1)1 4- 
V(x) is again zero, and we have: 

if x ^ {N — 1)/ + ^ : </> a (x) = c 0 e ialx - (N ~ l),] -I- ,) ' 1 (3-e) 

We must now match these various expressions for <p a (x) at 
x = — ..., (N — 1)/ + This is what we shall do in the following 

section. 


b. MATCHING CONDITIONS 

The functions v x and v x depend on the form of the potential chosen. 
We shall show, however, that it is simple to calculate them, and their derivatives 
as well, at the two edges of each barrier, by using the results of complement N,„. 

To do so, let us imagine that all but one of the barriers are removed, 
leaving, for example, the nth one, centered at x = (« — 1)/. Solution (3-d), 
always valid inside this barrier, must then be extended to the left and to the right 
by superposing plane waves. These waves are obtained by replacing, in for- 
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mulas (6-a) and (6-b) of N n „ x by x — (n — 1)/ and k by a, and adding an index n 
to A, A', A, A'. Thus we have, if the nth barrier is isolated: 

for x ^ (n — 1)/ — ■- : 


A. e 


[a -(«-!)!] 


+ A't 


rot [a - (« - I )/ ] 


for x (n — 1)1 + — : 


( 4 ) 


// gill*-("- 1)1 ] _|_ g -i*[x-(«- 1)/| (5) 

with : 

© - MW © ,6) 
where, with the change in notation taken into account, M(a) is the matrix 
M(k) introduced in complement N nl . Consequently, at the left edge of the nth barrier, 
the function <p a (x) defined in (3-d) has the same value and the same derivative 
as the superposition of plane waves (4). Similarly, at the right edge of this 
barrier, it has the same value and the same derivative as (5). These results 
enable us to write simply the matching conditions in the periodic structure. 

Thus, at the left edge of the first barrier (that is, at x = — 1/2), it is 
sufficient to note that (3-a) has the same value and the same derivative as 
A x e'** + A\ e“ ixx , which yields directly: 



(a result which was obvious from N,,,). 

At the right edge of the first barrier, which is the same as the left edge of the 

second one, we must write that A 1 e ,ax + A' e -1 ** and A 2 e'® (jc_,) + A' 2 
have the same value and the same derivative, which yields: 


r a 2 = a x c ic “ 

[ A 2 = A\ 

Similarly, at the junction of the nth and (n + l)th barriers (x = nl 



obtain, setting equal the value and derivative of (5) and those of the expression 
obtained by replacing n by n + 1 in (4): 



= A n t ial 


Finally, at the right edge of the last barrier 


x = (N 


( 9 ) 




we must 


write that (3-e) has the same value and the same derivative as the expression 
obtained by replacing n by N in (5), which yields: 



( 10 ) 
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c. ITERATION MATRIX Q( a) 


Let us introduce the matrix D( a) defined by: 


m = 


0 e 


It enables us to write the matching condition (9) in the form: 

C::)-«© 

that is, taking (6) into account: 


= D(a) M( 




Iterating this equation and using (7), we then obtain: 
= [D(a)M(a)] n 


= [0(a) M(a)] 


In ( A o\ 

w 


Finally, matching condition (10) can be transformed by using (6) and (14): 
^ = M(a)^^ = M(a)[D(a)Af(o)] w - 1 ^^ (15) 


that is: 


^ = M(a.)D(a)M(a)D(a)... D(ot)M(ct{ A ° 

C 0 / '-v--V4 0 

N matrices M(a) 


In this formula, which enables us to go from 


a matrix M{a) 


is associated with each barrier, and a matrix D( a), with each interval between 
two successive barriers. 

Relations (13) and (14) demonstrate the importance of the role played by 
the matrix: 

0(a) = 0(a) M(a) (17) 

which enters to the nth power when one goes from ^ 

l ° + * S ’ 


which enters to the nth power when one goes from to + is, 

when one performs a translation through a distance nl along the periodic 
structure. For this reason, we shall call 0(a) the “iteration matrix”. Using 
formula (18) of complement N ll( and expression (11) for D( a), we obtain: 


0(a) = 


e i3, F(a) e ix, G*(a)' 
e _, *'G(a) e~ ix, F*( a). 


The calculation of [0(a)]" is facilitated if we change bases so as to make 
0(a) diagonal; for this reason we shall study the eigenvalues of 0(a). 
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d. EIGENVALUES OF Q( a) 

Let 1 be an eigenvalue of Q(a). The characteristic equation of the matrix (18) 


is written: 

[e ia 'F(a) - X] [e~'“ , / r *(a) - X] - |G(a)| 2 = 0 (19) 

that is, taking into account relation (21) of complement N, n : 

X 2 - 2XX(a) +1=0 (20) 

where X (a) is the real part of the complex number c ial F(a): 

X(oc) = Re[e ia 'F(a)] = ^TrQ(a) (21) 

Recall [cf. complement N,„, relation (21)] that the modulus of F(a) is greater 
than 1; the same is therefore true of e‘“'F(a). 

The discriminant of the second-degree equation (20) is : 

A' = [1(a)] 2 - 1 (22) 

Two cases may then arise: 

(?) If the energy E is such that: 

\X(*)\ < 1 (23) 

(for example, if, in figure 2, a is between a 0 and a t ), one can set: 

X(a) = cos 0(a)/] (24) 

with: 

0 k(a) ^ j (25) 


y(a) 



FIGURE 2 

Variation with respect to a of the 

complex number 

e iat F( a) = X(a) + zT(a). 

Since \F(ol)\ > 1, the curve obtain¬ 
ed in the complex plane falls out¬ 
side the circle centered at O of unit 
radius. The following discussion 
shows that if |jf(a)| is less than 1, 
that is, if the value of a chosen 
gives a point of the curve which is 
between the two vertical dashed 
lines of the figure, the correspon¬ 
ding energy falls in an “allowed 
band ” ; in the opposite case, it falls 
in a “forbidden band". 
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A simple calculation then shows that the eigenvalues of Q(a) are given by : 

A = e ±i * (aH (26) 

There are therefore two eigenvalues, which are complex conjugates and whose 
modulus is equal to 1. 

(ii) If, on the other hand, the energy E gives a value of a such that: 
\X(cc)\ > 1 (27) 

(for example, if, in figure 2, a is between a l and a 2 ), one sets: 

X(a) = e cosh [p(a)/] (28) 

with: 

p(«) ^ 0 (29) 

and £ = + 1 if X(a) is positive, e = — 1 if X{a) is negative. We then find : 

A = e e ±p,a) ‘ (30) 

In this case, both eigenvalues of Q( a) are real, and they are each other’s inverse. 


2. Discussion: the concept of an allowed 
or forbidden energy band 

a. BEHAVIOR OF THE WAVE FUNCTION <p„(x) 

To apply (14), we begin by calculating the two column matrices A y ( a) 
and A 2 (oc) associated with the eigenvectors of Q( a) and corresponding respectively 
to the eigenvalues X x and A 2 . We then decompose the matrix 


I 1 ) into the form: 

\A'J 

GO = + c 2^A 2 (a) (31) 

which enables us to obtain directly: 

(fy = K~ l cMAM) + A" 2 - l c 2 («M 2 («) (32) 

It is clear from this expression that the behavior of the wave function is very 
different depending on whether | A" (a) | is smaller or greater than 1 in the energy 
domain of the wave function. In the first case, formula (26) shows that the 
effect of traversing successive barriers is expressed in (32) by a phase shift in the 

components of the column matrix res P ect to A x (<x) and A 2 (a). The 

behavior of <p a {x) here recalls that of a superposition of imaginary exponentials. 
On the other hand, if the energy is such that |-T(a)| > 1, formula (30) indicates 
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that only one of the two eigenvalues (for example, /. l ) has a modulus greater than 1. 
For n sufficiently large, we have, as a result: 

^ e 0 " 1 e (B_,, p<®> , c 1 (a) A x {a) (33) 

A n and A' n therefore increase exponentially with n [except in the special case 
where c 1 (oe) = 0]; the wave function (p a (x) then increases in modulus as it 
traverses the successive potential barriers, and its behavior recalls that of a 
superposition of real exponentials. 


b. BRAGG REFLECTION ; POSSIBLE ENERGIES FOR A PARTICLE 

IN A PERIODIC POTENTIAL 

Depending on whether <p x (x) behaves like a superposition of real or 
imaginary exponentials, the resulting phenomena can reasonably be expected to be 
very different. 

Let us evaluate, for example, the transmission coefficient T N (oc) of the set 
of N identical barriers. For these N barriers, relation (15) shows that the 
matrix M(a) [£>(a)]' v_ 1 plays a role analogous to the one played by A/(a) for 
a single barrier. Now, according to relation (29-b) of complement N„„ the 
transmission coefficient 7"(a) is expressed in terms of the element of this matrix 
which is placed in the first row and the first column [the inverse of 7^(a) is 
equal to the square of the modulus of this element]. What happens if the energy E 
of the particle is chosen so as to make the eigenvalues of Q(a) real, that is, given 
by (30)? When N becomes sufficiently large, the eigenvalue 2, = s e p(I)l becomes 
dominant, and the matrix [£>(a)] lv-1 increases exponentially with N [as can also 
be seen from relation (33)]. Consequently, the transmission coefficient decreases 
exponentially: 

T n { a)oce (34) 

In this case, for large values of N , the set of N potential barriers reflects the 
particle practically without fail. This is explained by the fact that the waves 
scattered by the different potential barriers interfere totally destructively for the 
transmitted wave, and constructively for the reflected wave. This phenomenon can 
therefore be likened to Bragg reflection. Note, moreover, that this destructive 
interference for the transmitted wave can be produced even if the energy E is 
greater than the height of the barrier (a case where, in classical mechanics, 
the particle is transmitted). 

Nevertheless, if the transmission coefficient of an isolated barrier is very close 
to 1, we have |F(a)| ^ 1 [for example, in figure 2, |F(a)| —*■ 1 if a, that is, 
the energy, approaches infinity]. The point representing the complex number 
e ,al F(a) is then very close to the circle of unit radius centered at O. Figure 2 
shows that the regions of the energy axis where |2f(a)| > 1, that is, where total 
reflection occurs, are very narrow and can practically be seen as isolated energy 
values. Physically, this is explained by the fact if the energy E of the incident 
particle is much larger than the amplitude of variation of the potential V(x), 
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its momentum is well-defined, as is the associated wavelength. The Bragg condition 

X 

l = n— (where n is an integer) then gives well-defined energy values. 

If, on the other hand, the energy E of the particle falls in a domain where 
the eigenvalues are of modulus 1 as in (26), the elements of the matrix [C?(a)] w-1 no 
longer approach infinity when N does. Under these conditions, the transmission 
coefficient 7^,(a) does not approach zero when the number of barriers is increased. 
We are again dealing with a purely quantum mechanical phenomenon, related 
to the wavelike nature of the wave function, which enables it to propagate in the 
regular periodic potential structure without being exponentially attenuated. Note 
especially that the transmission coefficient T N (x) is very different from the product 
of the individual transmission coefficients of the barriers taken separately (this 
product approaches zero when N —»- oo since all the factors are smaller than 1). 

Another interesting problem, encountered particularly in solid state physics, 
is that of the quantization of energy levels for a particle placed in a series of 
identical and evenly spaced potential wells, that is, placed in a potential V(x) having 
a periodic structure. This problem will be studied in detail in § 3; however, we can 
already guess the form of the spectrum of possible energies. If we assume that 
the energy of the particle is such that |jf(a)| > 1, equation (33) shows that the 
coefficients A n and A' n become infinite when n —> oo. It is clear that this possibility 
must be rejected, since it means that the wave function does not remain bounded. 
The corresponding energies are therefore forbidden; hence, the name of forbidden 
bands given to the energy domains for which |-T(a)| > 1. On the other hand, if 
the energy of the particle is such that |Jf(a)| < 1, A n and A' n remain bounded 
when n —»• oo ; the corresponding regions of the energy axis are called allowed 
bands. To sum up, the energy spectrum is composed of finite intervals inside which 
all the energies are acceptable, separated by regions all of whose energies are 
forbidden. 


3. Quantization of energy levels in a periodic potential; 
effect of boundary conditions 


Consider a particle of mass m placed in the potential V(x) shown in figure 3. In the 


/ l 

region - - ^ x ^ Nl + -, 


K(x) has the form of a periodic function, composed of a series 


of W + 1 successive barriers of height V 0 , centered at x = 0, /, 21 ,..., NL Outside this region, 
K(x) undergoes arbitrary variations over distances comparable to /, then becomes equal to 
a positive constant value V e . In what follows, the region [0, A7] will be called “inside the 

lattice” and the limiting regions x ~ ^ and x ^ Nl + ~, “ends (or edges) of the lattice”. 


Physically, such a function V(x) can represent the potential seen by an electron in 
a linear molecule or in a crystal (in a one-dimensional model). The potential wells situated 


at x = —, ... then correspond to the attraction of the electron by the various ions. Far from 
the crystal (or the molecule), the electron is not subject to any attractive forces, which is 
why V(x) rapidly becomes constant outside the region — — < x ^ Nl + —. 
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Variation with respect to x of the potential seen by an electron in a “one-dimensional crystal” and 
on its edges. Inside the crystal, the potential has a periodic structure; V(x) is maximum between 
the ions (barriers at x = 0 , /, 2 /, ...) and minimum at the positions of the ions (wells at x — 1/ 2, 
3//2, ...)• On the edges of the crystal, V(x) varies in a more or less complicated way over a distance 
comparable to /, then rapidly approaches a constant value V e . 


The potential V(x) that we have chosen fits perfectly into the framework of comple¬ 
ment M m (apart from a change in the energy origin). We already know, therefore, that the 
bound states of the particle form a discrete spectrum of energies, all less than V e . However, the 
potential V(x) picked here also presents the remarkable peculiarity of having a periodic structure 
of the type of those considered in § 1 above; relying on the results of this section, we shall 
show that the conclusions of complement M,„ take on a special form in this case. For example, 
we stressed the fact in complement M„, that it is the boundary conditions \_<p(x) —> 0 when 
x —> ± oo ] which introduce the quantization of the energy levels. The boundary conditions 
of the problem we are studying here, that is, the variation of the potential at the edges of the 
lattice, might thus be expected to play a critical role in determining the possible energies. 
Actually, this is not at all the case: we shall see that these energies depend practically only on 
the values of V{x) in the region where it is periodic, and not on the edge effects (on condition, 
of course, that the number of potential wells is sufficiently large). In addition, we shall verify 
the result obtained intuitively in § 2-b, showing that most of the possible energies are grouped 
in allowed energy bands. Only a few stationary states, localized near the edges, depend on 
a critical manner on the variation of V(x) in this region and can have an energy which falls in 
a forbidden band. 

We shall therefore proceed essentially as in complement M m , first examining precisely 
the conditions imposed on the wave function <p a (x) of a stationary state. 


a. CONDITIONS IMPOSED ON THE WAVE FUNCTION 


In the region where V(x) is periodic, relation (3-d) gives the form of the wave func¬ 
tion <p a (x); the coefficients A n and A' n are determined from (32). To write (32) more explicitly, 
let us set 


c^A^cc) = 


c 2 {<x)A 2 (ot) = 



(35) 
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We then obtain: 

•<, = /,(«) aT 1 + / 2 (a)A" 2 -‘ 
= /'i(a) ^r 1 + /' 2 (a) Ar 1 


Now let us examine the boundary conditions on the wave function <p a (x). First of all, 
to the left, far from the lattice, V(x) is equal to V e and <p a (x) is written in the form: 

<?>*(*) = (37-a) 


with: 


2m 

A*)=J—lVe - £) 


(37-b) 


(we eliminate the solution in e _M(flf>x , which diverges when x —^ - 00 ). The probability current 
associated with the function (37) is zero (cf complement B„,, § 1). Now, for a stationary state, 
this current is independent of x [cf complement N m , relation (12)]; it therefore remains zero 
at all a\ even inside the lattice. According to relation (14) of complement N n „ the coefficients A n 
and A' n therefore necessarily have the same modulus*.Thus, if we choose to express the boundary 
conditions on the left as relations between the coefficients A x and A\ [that is, by writing that 

the expression for <p a (x) for — ~ ^ x < is the extension of the wave function (37)], we 
find a relation of the form: 


= e '* w 

A\ 


(38-a) 


x(a) is a real function of a (and therefore of the energy E) which depends on the precise behavior 
of V(x) at the left-hand edge of the lattice [in what follows, we shall not need the exact expression 
for this function x(a); the essential point is that the boundary conditions on the left have the 
form (38-a)]. 

The same type of reasoning can obviously be applied on the right (x —+ 00 ), where 
the boundary conditions are written: 

iiLLL = (38-b) 

^N+ 1 


where the real function x'(«) depends on the behavior of V(x) on the right-hand edge of the 
lattice. 

To sum up, we can say that the quantization of the energy levels can be obtained in the 
following manner: 

— we start with two coefficients A x and A\ which satisfy (38-a); this insures that the 
function <p a (x ) will remain bounded when x —> — 00 . Since (p a (x) is defined to within 
a constant factor, we can choose, for example: 

A x = e iz(a)/2 

A ; = e ~ iZ (*)/2 (39) 

- we then calculate, using (36), the coefficients A n and A f n so as to extend the wave 
function chosen throughout all the crystal. Note that the condition (39) implies that <p a (x) is 
real (cf complement N m , § 1-b); calculation of A n and A' n must therefore yield: 

K = **n ( 40 ) 


- finally, we write that the coefficients A N+l and ^ +1 satisfy (38-b), a relation which 
insures that (p a (x) will remain bounded when x —> + 00 . In fact, relation (40) shows that 
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the ratio A N + x jA^ + x * s automatically a complex number of unit modulus; condition (38-b) 
therefore amounts to an equality between the phases of two complex numbers. We thus obtain 
a real equation in a, which has a certain number of real solutions giving the allowed energies. 

We are going to apply this method, distinguishing between two cases: real eigenvalues 
of Q(cc) [the case where \X(oc)\ > 1] and imaginary ones [the case where \X{x)\ < 1]. 


b. ALLOWED ENERGY BANDS: 

STATIONARY STATES OF THE PARTICLE INSIDE THE LATTICE 

First assume that the energy £ is in a domain where |A"(a)| < 1. 


a. Form of the quantization equation 

Taking (26) into account, relations (36) become: 

f A n = f t (oc)e i<n -' m * )l + f 2 ( a)e ,( " im * u 
lA' n = f \( a ) e ,( "-+ /' 2 (a)e 1 >*<*>' 


(41) 


Also, we have seen that the choice (39) of A t and A\ implies that A' n = A* for all «. Now, it is 
easy to show that relations (41) yield two complex conjugate numbers only if: 

/?(«) = fi(*) 

/?(«) = f[( a) (42) 


Condition (38-b) can then be written: 

/,(«)e 2 ~ + m = e ,*.„ 

/5(a) e 2 ' m,a,< + /*( a) 


(43) 


This equation in a is the one which gives the quantization of the energy levels. To solve it, 
let us set: 


0(a) = Arg 


I- 

u< 


(«) e**' (,>/2 - / 2 («)e~ ,/,|a>/2 
fAoL)e,-^’ MI2 - /t(a) e‘ x MI2 


(44) 


[©(a) can, in principle, be calculated from y(a), y'(a) and the matrix {2(a)]. Equation (43) can 
then be written simply : 

e 2iNkU>/ _ (45) 


The energy levels are therefore given by: 


m = 


0(a) 

~2Nl 



with: 


P = 0, 1, 2, ..., (N - l) 


(46) 

(47) 


[the other values of p must be excluded since condition (25) here forces k{ a) to vary within an 
interval of width njl\ We can already see that if N is very large, we can write equation (46) in 
the simplified form: 


m 2 = 


71 


p m 


(48) 
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p. Graphical solution ; locating the energy levels 

If we substitute definition (24) of k( a) into (46), we obtain an equation in a which gives 
the allowed energies. To solve it graphically, let us begin by tracing the curve which represents 
the function X(<x) = Re [e" 1 / 7 ^)]. Because of the imaginary exponential e :a *, we expect this 
curve to have an oscillatory behavior, of the type of that shown in figure 4-a. Since |F(a)| is 
greater than 1 [cf. complement N,„, relation (32)], the amplitude of the oscillation is greater 
than 1, so the curve intersects the two straight lines X(<x) = + 1 at certain values a 0 , a t , a 2 , ... 
of the variable a. We then eliminate all regions of the a-axis, bounded by these values, where 
the condition |A"(a)| < 1 is not satisfied. Using the set of arcs of curves thus obtained for X(ot\ 
we must represent the function: 

k( a) = — Arc cos X(<x) ( 49 ) 



Variation with respect to a of X(a) = Re[F(a) e u/ ] (see fig. 2) and of k(<x) = — Arc cos [A"(a)]. The 

values of a (that is, of the energy E) associated with stationary states are obtained (if N > 1) by 
cutting the curve which represents k( a) with the horizontal lines whose equations are 
y = pnjNl(p = 0, 1, 2, ... N — 1). The allowed bands are thus revealed. Each includes N levels 
which are very close together (intervals a 0 < a < etc.). The forbidden bands are represented by 
the shaded areas (a t < a < a 2 , etc...). 

The dashed-line curves correspond to the special case where V(x) = 0 (a free particle). 
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Taking into account the form of the Arc cosine function (cf. fig. 5), we are led to the curve 
whose shape is shown in figure 4-b. Equation (46) indicates that the energy levels correspond 

to the intersections of this curve with those which represent the functions-- + p —that is, 

2 Nl Nl 


if N P 1, with the horizontal lines whose equations are>> = p -^(with p = 0, 1, 2, N — 1). 

We thus obtain groups of N levels, associated with equidistant values of &(a) and 
situated in the allowed bands defined by <x 0 < a < cc l9 a 2 < a < a 3 , etc. Between these allowed 
bands are the forbidden bands (we shall examine their properties in §c). 



figure 5 

The Arc cosine function. 


If we consider a particular allowed band, we can locate each level according to the value 
of k(<x) which corresponds to it. This leads to choosing k as the variable and considering a and, 
consequently, E as functions a (k) and E{k) of k . The variation of a with respect to k is given 


directly by the curve of figure 4-b, so it suffices to evaluate the function 
energy E(k\ The corresponding curve has the shape shown in figure 6. 


h 2 <x 2 


to obtain the 



figure 6 

Variation of the energy with respect to the para¬ 
meter k. The solid lines correspond to the energies for 
the first two allowed bands (the values of k which 
give the energy levels being equidistant inside the 
interval 0 < k ^ n /). The dashed lines correspond 
to the special case where the potential V(x ) is zero 
(a free particle); the allowed bands are contiguous, 
and there are no forbidden bands. 
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comment: 


It is clear from figure 4-b that, to a given value of k , correspond several values 
of a and therefore of the energy; this is why several arcs appear in figure 6. Nevertheless, 
if, within a given allowed band, X(oc) increases steadily from — 1 to + 1 (or decreases 
steadily from + 1 to - 1), only one energy level corresponds to each value of k for this 
band, and this band includes N energy levels . 


y. Discussion 

The preceding calculations show how, when we go from N = 1 to very high values of N> 
we move gradually from a set of discrete levels to allowed energy bands. Rigorously, these 
bands are formed by discrete levels, but their separation is so small for a macroscopic lattice 
that they practically constitute a continuum. When k is taken as a parameter, the density of 
states (the number of possible energies per unit interval of k) is constant and equal to Nl/n. 
This property, which is very useful, explains why k is generally chosen as the variable. 

An important point appears in going from (46) to (48): when N is large, the edge 
effects of the lattice, which enter only through the intermediary of the functions x(a), x ( a ) and, 
in (46), 0(a), no longer play any role; only the form of the periodic potential inside the lattice 
is important in determining the possible energies. 

It is interesting to consider the two following limiting cases: 

(/) If V(x) = 0 (free particle), we have: 

r m = i 

\ X(ct) = cos a/ (50) 


and we obtain: 


if 0 < a < — : 
rc 2n 

f r * “ 4 t 


fc(a) = a 


. 2ji 

m = — - a 


etc... 


(51) 


(the corresponding broken line is shown in figure 4-b as a dashed line). Relation (50) shows 
that the condition |AT(a)| ^ 1 is always satisfied : as we know, forbidden bands do not exist for 
a free particle. 

Figure 6 thus enables us to see the effect of the potential V(x) on the curve E(k). When 
forbidden bands appear, the curves representing the energy become deformed so as to have 
horizontal tangents for k = 0 and k = njl (edges of the band). Unlike what happens for a free 
particle, there exists a point of inflection for each band where the energy varies linearly with k. 

(ii) If the transmission coefficient T(a) is practically zero, we have [cf complement N,„, 
equations (29) and (21)]: 

r mi > i 

l|G(a)| > 1 (52) 


In figure 2, the point representing the complex number e ial F(<x) is very far from the origin. 
We thus see in this figure that the regions of the a-axis where |A"(a)| < 1 are extremely narrow. 
The allowed bands therefore shrink if the transmission coefficient of the elementary barrier 
decreases; in the limit of zero transmission, they reduce to individual levels in an isolated well. 
Inversely, as soon as the tunnel effect allows the particle to pass from one well to the next one, 
each of the discrete levels of the well gives rise to an energy band , whose width increases as the 
transmission coefficient grows . We shall return to this property in complement F XI . 
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C. FORBIDDEN BANDS: STATIONARY STATES LOCALIZED ON THE EDGES 

a. Form of the equations; energy levels 

Let us now assume that a belongs to a domain where |Jf(a)| > L According to (30), 
relations (36) can then be written: 

fA n = e" - l [/i(«) e (n ~ i)p(a)i + f 2 (oc) e~ (n ~] 

L/C = s n ~ ‘[/'((a) e ( "“ 1)p(a)/ + /' 2 (oc) e _(r,_1)p(a>l ] 1 

The fact that A' n = A* for all n means that we must have here: 

r/i(«)=/f(«) 

l n*) = /}(«) (54) 

The quantization condition (38-b) then takes on the form: 

+ ^ (55) 
A’ N + l /?(«) + /!( a )e- 2 ^ l “»' 


that is: 




where the real function L(a) is defined by: 


L(oc) = - 


/1 (a) e i; 


- /i(«)e“ 


/|(a)e ,V(a)/2 - / 2 (a)e-'‘ z ' (i 


Consider the case where N > 1; we then have e 2Np(a)l ^ 0, and equation (56) reduces 


L(a) = 0 (58) 

The energy levels situated in the forbidden bands are therefore given by the zeros of the func¬ 
tion L(a) (cf fig. 7). Centers neither into (57) nor into (58), so the number of these levels does not 
depend on N (unlike the number of levels situated in an allowed band). Consequently, when N > 1, 
it can be said that practically all the levels are grouped in the allowed bands. 


P. Discussion 

The situation here is radically different from the one encountered in §b : the number N, 
that is, the length of the lattice, plays no role (provided, nevertheless, that it is sufficiently 
large); on the other hand, definition (57) of L( a) shows that the functions x(a) and x '(a) play 
an essential role in the problem. Since we already know that these functions depend on the 
behavior of V(x) on the edges of the lattice, we expect to obtain states localized in these regions. 

This is indeed the case. Equations (57) and (58) offer two possibilities: 

(0 if /i(a) ^ 0, the fact that L(ol) - 0 requires that: 

MU. = = e*'w (59) 

/?(«) / i (°0 

Let us return to definition (35) of / t (a) and ([(a); we see that relation (59) shows that the wave 
function constructed from the first eigenvector of Q( a) satisfies the boundary conditions on 
the right. This is easy to understand: if we start at x = 0 with an arbitrary wave function which 
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FIGURE 7 

Variation of L( a) with respect to a in 
a forbidden band. The zeros of L(a) 
gi\e the stationary states which are 
localized on the edges of the lattice. 


satisfies the boundary conditions on the left, the matrix 


- © 


has components on the two 


eigenvectors of (2(a); the coefficients A N+ , and A' N+i are then (if N 1) essentially given by (33), 


which expresses the fact that the matrix 



is proportional to the column matrix of the 


first eigenvector of (2(a). 

Note that since the eigenvalue Aj(a) is greater than 1, the wave function grows expo¬ 
nentially when jc increases. The stationary state given by the first eigenvector of (2(a) is there¬ 
fore localized at the right end of the lattice. 

(//) if fi(ot) = 0, (54) gives /i(a) = 0, and definitions (35) imply that c t (a) = 0: the 
corresponding stationary state is associated with the second eigenvector of (2(a). Aside from 
the fact that this state is localized at the left end of the lattice, the conclusions obtained in (/) 
remain valid. 


References and suggestions for further reading : 

Merzbacher (1.16), chap. 6, §7; Fliigge (1.24), §§28 and 29; Landau and 
Lifshitz (1.19), §104; see also solid state physics texts (section 13 of the bibliography). 
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A. QUANTIZATION OF THE ANGULAR MOMENTUM 


In this chapter, we intend to illustrate the postulates of quantum mechanics, 
which we stated and discussed in chapter III. We shall apply them to simple 
concrete cases, in which the dimension of the state space is finite (equal to two). 
The interest of these examples is not confined to their mathematical simplicity, 
which will allow a better understanding of the postulates and their consequences. 
It is also based on their physical importance: they exhibit typically quantum 
mechanical behavior which can be verified experimentally. 

In §§A and B, we shall study the spin 1/2 case (which we shall take up again 
in more detail in chapter IX). First, we shall describe (§A-1) a fundamental experi¬ 
ment which revealed the quantization of a simple physical quantity, the angular 
momentum. We shall see that the component along Oz of the angular momentum 
(or magnetic moment) of a neutral paramagnetic atom can take on only certain 
values, which belong to a discrete set. Thus, for a silver atom in its ground state, 
there are only two possible values ( + h/2 and — ft/2) for the component S z of 
its angular momentum: a silver atom in the ground state is said to be a spin 1/2 
particle. In §A-2, we indicate how quantum mechanics describes the “ spin variables” 
of such a particle. In situations where one can dispense with a quantum treatment 
of the “external variables” r and p, the state of the particle (“spin state space”) 
has only two dimensions. We shall then (§B) be able to illustrate and discuss the 
quantum mechanical postulates in this particularly simple case.we shall first see how 
to prepare silver atoms in any desired arbitrary spin state, in a real experiment. 
We shall then show how the measurement of the physical values of the spin on 
such silver atoms enables us to verify the quantum mechanical postulates experi¬ 
mentally. By integrating the corresponding Schrodinger equation, we shall study the 
evolution of a spin 1/2 particle in a uniform magnetic field (Larmor precession). 
Finally, in §C, we shall begin the study of two-level systems. Although these systems 
are not generally spin 1/2 particles, their study leads to calculations which are 
very similar to those developed in §§A and B. We shall treat in detail the effect 
of an external perturbation on the stationary states of a two-level system and use 
this very simple model to point out important physical effects. 


A. SPIN 1/2 PARTICLE: 

QUANTIZATION OF THE ANGULAR MOMENTUM 

1. Experimental demonstration 

First of all, we are going to describe and analyze the Stern-Gerlach experi¬ 
ment, which demonstrated the quantization of the components of an angular 
momentum (sometimes called “space quantization”). 
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a. THE STERN-G ERLACH APPARATUS 

The experiment consists of studying the deflection of a beam of neutral 
paramagnetic atoms (in this case, silver atoms) in a highly inhomogeneous magnetic 
aeld. The apparatus used is shown schematically in figure 1 *. 




FIGURE i 

Schematic drawing of the Stern-Gerlach 
experiment. In figure a is shown the tra¬ 
jectory of a silver atom emitted from the 
high-temperature furnace E . This atom is 
deflected by the gradient of the magnetic 
field created by the electromagnet A and 
then condenses at N on plate P. 


Figure b shows a cross section in the xOz plane of the electromagnet A ; the lines of force of the 
magnetic field are shown in dashed lines. B z has been assumed to be positive and dBJdz , negative. 
Consequently, the trajectory of figure a corresponds to a negative component Ji z of the magnetic 
moment, that is, to a positive component of ^ <7 is negative for a silver atom). 


Silver atoms contained in a furnace E, which is heated to a high temperature, 
leave through a small opening and propagate in a straight line in the high vacuum 
which exists inside the whole apparatus. A collimating slit F selects those atoms 
whose velocity is parallel to a particular direction that we shall choose for the Oy 
axis. The atomic beam thus constructed traverses the gap of an electromagnet A 
before condensing on a plate P. 

Let us describe the characteristics of the magnetic field B produced by the 
electromagnet A. This magnetic field has a plane of symmetry (which we shall 


* We shall indicate here only the most important characteristics of this equipment. A more 
detailed description of the experimental technique can be found in a book on atomic physics. 
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designate by yOz) which contains the initial direction Oy of the atomic beam. 
In the air-gap, it is the same at all points situated on any given line parallel to Oy 
(the edges of the electromagnet are parallel to Oy, and we neglect edge effects). 
B has no component along Oy. Its largest component is along Oz\ it varies strongly 
with z: in figure 1-b, the field lines are much closer together close to the north pole 
than close to the south pole of the magnet. Of course, since the magnetic field has 
a conserved flux (div B = 0), it must also have a component along Ox which varies 
with the distance x from the plane of symmetry. 

b. CLASSICAL CALCULATION OF THE DEFLECTION* 

Note, first, that the silver atoms, being neutral, are not subject to the 
Lorentz force. On the other hand, they possess a permanent magnetic moment M 
(they are paramagnetic atoms); the resulting forces are derived from the potential 
energy: 

W = - M. B (A-l) 

The existence, for an atom, of an electronic magnetic moment M and an 
angular momentum £f is due to two causes : the motion of the electrons about 
the nucleus (the corresponding rotation of the charges being responsible for the 
appearance of an orbital magnetic moment) and the intrinsic angular momentum, 
or spin, ( cf chapter IX) of the electrons, with which is associated a spin magnetic 
moment. It can be shown (as we shall assume here without proof) that, for a given 
atomic level, M and if are proportional**: 

M=yif (A-2) 

The proportionality constant y is called the gyromagnetic ratio of the level under 
consideration. 

Before the atoms traverse the electromagnet, the magnetic moments of the 
silver atoms which constitute the atomic beam are oriented randomly (isotropically). 
Let us study the action of the magnetic field on one of these atoms, whose magnetic 
moment M has a given direction at the entrance of the air-gap. From expres¬ 
sion (A-l) for the potential energy, it is easy to deduce that the resultant of the 
forces exerted on the atom is: 

F = V {M. B) (A-3) 

(this resultant would be equal to zero if the field B were uniform), and that their 
total moment relative to the position of the atom is: 

f = ^xB (A-4) 


* We shall confine ourselves here to outlining this calculation; for more details, we refer the 
reader to a book on atomic physics. 

** In the case of silver atoms in the ground state (like those of the beam), the angular momen¬ 
tum Sf is simply equal to the spin of the outer electron, which is therefore solely responsible for the exis¬ 
tence of the magnetic moment M. This is because the outer electron has a zero orbital angular momentum, 
and the resultant orbital and spin angular momenta of the inner electrons are also zero. Moreover the 
experimental conditions realized in practice are such that effects linked to the spin of the nucleus are 
negligible. This is why the silver atom in the ground state, like the electron, has a spin 1/2. 


389 




CHAPTER IV SPIN 1/2 AND TWO-LEVEL SYSTEMS 


The angular momentum theorem can be written: 



that is: 


dr 




(A-5) 

(A-6) 


The atom thus behaves like a gyroscope (fig. 2): d£P/dt is perpendicular to Sf, and 
the angular momentum turns about the magnetic field, the angle 6 between if and B 
remaining constant. The rotational angular velocity is equal to the product of the 
gyromagnetic ratio y and the modulus of the magnetic field. The components of M 
which are perpendicular to the magnetic field therefore oscillate about zero, the 
component parallel to B remaining constant. 



FIGURE 2 

The silver atom possesses a magnetic moment M and an angu¬ 
lar momentum if which are proportional. Consequently, the 
effect of a uniform maguetic field B is to cause M to turn 
about B with a constant angular velocity (Larmor precession). 


To calculate the force F [formula (A-3)], we can, to a very good approxima¬ 
tion, neglect in W the terms proportional to M x and M y and take Jt z to be 
constant. This is because the frequency of oscillation due to the rotation of M 
is so great that only the time-averaged values of Jt x and Ji y can play a role in W, 
and these are both zero. Consequently, it is as if the atom were submitted to the 
sole force: 

F' = S{J( Z B Z ) = M.SB Z (A-7) 

In addition, the components of \B Z along Ox and Oy are zero : dBJdy = 0 
because the magnetic field is independent of y (§a above), and dBJdx = 0 at all 
points of the plane of symmetry yOz. The force on the atom is therefore parallel 
to Oz and proportional to Jt z . Since it is this force which produces the deflection HN 
of the atom (fig. 1), HN is proportional to Ji z (and hence, to if, z ). Consequently, 
measuring HN is equivalent to measuring Ji z or if z . 
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Since, at the entrance to the air-gap, the moments of the various atoms of the 
beam are distributed isotropically (all values of Jf z included between \M\ and 
- \M\ are found), we expect the beam to form a single pattern, symmetrical with 
respect to H, on the plate P. The upper bound N j and the lower bound N 2 of this 
pattern correspond in principle to the maximum value \M\ and minimum 
value — \M\ of J( z . In fact, the dispersion of the velocities and the finite width 
of the slit F cause the atoms having a given value of M z to condense, not at the 
same point, but in a spot centered about the deflection corresponding to the 
average velocity. 

c. RESULTS AND CONCLUSIONS 

The results of the experiment (performed for the first time in 1922 by Stem 
and Gerlach) are in complete contradiction with the preceding predictions. 

We do not observe a single spot centered at H, but two spots (fig. 3) centered 
at the points N 1 and N 2 , symmetrical with respect to H (the width of these two 
spots corresponds to the effect of the dispersion of the velocities and of the width 
of the slit F). The predictions of classical mechanics are therefore shown to be inva¬ 
lidated by the experiment. 



FIGURE 3 

Spots observed on the plate P in the Stern-Gerlach experiment. The magnetic 
moments M of the atoms emitted from the furnace E are distributed ran¬ 
domly m all directions of space, so classical mechanics predicts that a mea¬ 
surement of J( z can yield with equal probability all values included between 
+ \M\ and - \Jt\, One should therefore observe only one spot (dashed 
lines in the figure). In reality, the result of the experiment is completely dif¬ 
ferent: two spots, centered at N t and N 29 are observed. This means that a 
measurement of Ji z can yield only two possible results (quantization of the 
measurement result). 


Now let us see how the preceding results can be interpreted. Of the physical 
quantities associated with a silver atom, some correspond to its external degrees 
of freedom (that is, are functions of its position r and its linear momentum p), 
and others, to its internal degrees of freedom (also called spin degrees of freedom) 
M or SF. 

Let us first show that, under these experimental conditions, it is not necessary 
to treat the external degrees of freedom quantum mechanically. To do this, we 
shall verify that it is possible, in order to describe the motion of the silver atoms, 
to construct wave packets whose width Az and momentum dispersion Ap z are 
completely negligible. Az and Ap z must satisfy the uncertainty relation : 

Az . Ap z >h (A-8) 
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Numerically, the mass M of a silver atom is equal to 1.8 x 10 25 kg. Az and the 
velocity uncertainty Av z = ApJM must be such that: 

Az.Av 2 Z^j- 10“ 9 M.K.S.A. (A-9) 

Now what are the lengths and velocities involved in the problem? The width of the 
slit F is equal to about 0.1 mm and the separation iVjiV 2 of the two spots, several 
millimeters. The distance over which the magnetic field varies appreciably can be 
deduced from the values of the field in the middle of the air-gap (B ~ 10 4 gauss) 

and its gradient ^ 10 5 gauss/cmj, which yields ~ 1 mm. In addition, 

the velocity of the silver atoms leaving a furnace at an absolute temperature of 
1 000 °K is of the order of 500 m/s. However well-defined the beam is, the disper¬ 
sion of the velocities along Oz is not much less than several meters per second. It is 
then easy to find uncertainties Az and Av z , which, while satisfying (A-9), are 
negligible on the scale of the experiment being considered. As far as the external 
variables r and p of each atom are concerned, it is therefore not necessary to 
resort to quantum mechanics in this case. It is possible to reason in terms of quasi¬ 
pointlike wave packets moving along classical trajectories. Consequently, it is 
correct to claim that measurement of the deflection HN constitutes a measurement 
of Ji, or &’ z . 

The results of the experiment thus lead us necessarily to the following 
conclusion: if we measure the component £f z of the intrinsic angular momentum 
of a silver atom in its ground state, we can find only one or the other of two values 
corresponding to the deflections HN X and HN 2 . We are therefore obliged to 
reject the classical image of a vector S? whose angle 6 with the magnetic field can 
take on any value : £^ z is a quantized physical quantity whose discrete spectrum 
includes only two eigenvalues. When we study the quantum theory of angular 
momentum (chap. VI), we shall see that these eigenvalues are + Tij 2 and — ft/2; 
we shall assume this here and say that the spin of the silver atom in its ground state 
is 1/2. 


2. Theoretical description 

We are now going to show how quantum mechanics describes the degrees 
of freedom of a silver atom, that is, of a spin 1/2 particle. 

We do not yet possess all the necessary elements for the presentation of 
a deductive and rigorous theory of the spin 1/2 particle. Such a study will be 
developed in chapter IX, in the framework of the general theory of angular 
momentum. We shall therefore be forced here to assume without proof a small 
number of results which will be proved later, in chapter IX. Such a point of view 
is justified by the fact that the essential goal of the present chapter is to show the 
reader how to handle the quantum mechanical formalism in a simple and concrete 
case, and not to stress the angular momentum aspect of the spin 1/2. The idea is to 
give precise examples of kets and observables, to show how physical predictions 
can be extracted from them and how to distinguish clearly between the various 
stages of an experiment (preparation, evolution, measurement). 
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We saw in chapter III that with every measurable physical quantity must be 
associated, in quantum mechanics, an observable, that is, a Hermitian operator 
whose eigenvectors can form a basis in the state space. We must therefore define 
the state space and the observables corresponding to the components of SP 
(SP X , £P y , 9 > z and, more generally, £P U = if. u, where u is an arbitrary unit vector), 
which we know from § 1 to be measurable. 


a. THE OBSERVABLE S t AND THE SPIN STATE SPACE 


With SP Z we must associate an observable S z which has, according to the 
results of the experiment described in § 1 above, two eigenvalues, + Ti/2 and 
— ft/ 2. We shall assume (see chap. IX) that these two eigenvalues are not degenerate, 
and we shall denote by | + ) and | — ) the corresponding orthonormal eigenvectors : 


>= + > 
>=-"!-> 


(A-10) 


< +1 + > = <-!-> = 1 
< + I - > =0 


(A- It) 


S z alone therefore forms a C.S.C.O., and the spin state space is the two-dimensional 
space S s spanned by its eigenvectors | + ) and | — >. The fact that these eigen¬ 
vectors constitute a basis of S s is expressed by the closure relation : 

I +>< + I + | - >< - | = n (A-12) 


(A-12) 


The most general (normalized) vector of S s is a linear superposition of | + > 
and | — ) : 

|^> = a|+> + !?|-> (A- 13 ) 


with: 


+ W = 1 


(A-14) 


In the { | -I- >, | - > } basis, the matrix representing S z is obviously diagonal 
and is written: 




(A-15) 


b. THE OTHER SPIN OBSERVABLES 

With the SP X and SP y components of SP will be associated the observables S x 
and S y . The operators S x and S y must be represented in the { | + >, | — > } basis 
by 2 x 2 Hermitian matrices. 
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We shall see in chapter VI that in quainum mechanics, the three components 
of an angular momentum do not commute with each other but satisfy well- 
defined commutation relations. This will enable us to show that, in the case of 
a spin 1/2, with which we are concerned here, the matrices representing S x and S y 
in the basis of the eigenvectors | + > and | - > of S z are the following: 


< S HC J) (A - ,6) 

-3 


For the moment, we shall assume this result. 

As for the 9 > u component of SP along the unit vector u, characterized by the 
polar angles 0 and (p (fig. 4), it is written: 


2 



FIGURE 4 

Definition of the polar angles 0 and <p characterizing a unit 
vector u. 


if u = !P. u = if x sin 0 cos <p + iP y sin 0 sin <p + 1Z Z cos 0 (A-18) 

Using (A-15), (A-16) and (A-17), we easily find the matrix which represents the 

corresponding observable S u — S . u in the { | + >, | — > } basis: 

(SJ = ( S x ) sin 0 cos (p + (S y ) sin 0 sin (p + (S z ) cos 0 

ft I cos 0 sinOe"'^ (A-19) 

2 \sin 0 e ,,p — cos 0 ) 

In what follows, we shall need to know the eigenvalues and eigenvectors of 
the observables 5^, S y and S u . The calculations which enable us to obtain them 
from the matrices (A-16), (A-17) and (A-19) do not present any difficulty. We shall 
simply present the results here. 

The S x , S y and S u operators have the same eigenvalues, + ft/2 and — ft/2, 
as S 2 . This result could have been expected, since it is always possible to rotate 
the Stern-Gerlach apparatus as a whole so as to make the axis defined by the 
magnetic field parallel either to Ox, to Oy, or to u. Since all directions of space have 
the same properties, the phenomena observed on the plate of the apparatus must 
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be unchanged under such rotations: the measurement of £P X , £f y or 9 > u can therefore 
yield only one of two results : + li/2 or — h/2. 

As for the eigenvectors of S x , S y and S u , we shall denote them respectively 
by | + } x , | ± } y and | + >„ (the sign in the ket is that of the corresponding eigenvalue). 


expansions on the 

basis 

; of eigenvectors | + ) of S z is written: 


1 ± >* = 

71 [l + 

>± 

|->] 

(A-20) 

1 ± >y = 


> + 

« 1 “ >] 

(A-21) 

| + >« 

6 

= cos — e 

t0/2 | 

+ > + sin^ e^ 12 \ - > 

(A-22-a) 

J“ >„ 

. e 

= - sm- 

e ,>/2 | + > + cos e i0/2 1 — ) 

(A-22-b) 


B. ILLUSTRATION OF THE POSTULATES 
IN THE CASE OF A SPIN 1/2 


Using the formalism which we have just described, we are now going to apply 
the postulates of quantum mechanics to a certain number of experiments on silver 
atoms which can actually be performed with the Stern-Gerlach apparatus. We shall 
thus be able to discuss the consequences of these postulates in a concrete case. 


1. Actual preparation of the various spin states 

In order to make predictions about the result of a measurement, we must 
know the state of the system (here, the spin of a silver atom) immediately before 
the measurement. We are going to see how to prepare a beam of silver atoms so 
that they are all in a given spin state. 


a. PREPARATION OF THE STATES |+> AND | -> 

Let us assume that we pierce a hole in the plate P of the apparatus represented 
in figure 1-a, at the position of the spot centered at N t (fig. 3). The atoms which 
are deflected downward continue to condense about N 2 , while some of those which 
are deflected upward pass through the plate P (fig. 5). Each of the atoms of the 
beam which propagates to the right of the plate is a physical system on which we 
have just performed a measurement of the observable S 2 , the result being + h/2. 
According to the fifth postulate of chapter III, this atom is in the eigenstate which 
corresponds to this result, that is, in the state | + > (since S z alone constitutes a 
C.S.C.O., the measurement result suffices to determine the state of the system 
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after this measurement). The device in figure 5 thus produces a beam of atoms 
which are all in the spin state | + >. This device acts like an “atomic polarizer” 
since it acts the same way on atoms as an ordinary polarizer does on photons. 

Of course, if we pierced the plate around N 2 and not around N x , we would 
obtain a beam all of whose atoms would be in the spin state | — ). 




FIGURE 5 

When we pierce a hole in the plate P at the position of the spot N 19 the atoms which 
pass through this hole are all in the spin state | + >. The Stern-Gerlach apparatus is then acting like a 
polarizer. 


b. PREPARATION OF THE STATES | ±>*, | + > y , | + > u 

The observable S x also constitutes a C.S.C.O. since none of its eigenvalues 
is degenerate. To prepare one of its eigenstates, we must simply select, after a 
measurement of S x , the atoms for which this measurement has yielded the 
corresponding eigenvalue. In practice, if we rotate the apparatus of figure 5 
through an angle of + njl about Oy, we obtain a beam of atoms whose spin state 

is | + >* (fig- 6). 



FIGURE 6 

When the apparatus of figure 5 is rotated through 90° about Oy , we obtain a polarizer which prepares 
atoms in the spin state | -f ) x . 
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This method can immediately be generalized: by placing the Stern-Gerlach 
apparatus so that the axis of the magnetic field is parallel to an arbitrary unit 
vector u, and piercing the plate either at N r or at N 2 , we can prepare silver atoms 
in the spin state | + >„ or | — >„*. 


c. PREPARATION OF THE MOST GENERAL STATE 


We indicated above that the most general (normalized) ket of the spin state 
space is of the form: 


|^> = a| + > + 0|-> 

(B-l) 

with: 


|a| ! + |0|’ - 1 

(B-2) 


Is it possible to prepare atoms whose spin state is described by the corresponding 
ket | if/ > ? 

We are going to show that there exists, for all 1 ip >, a unit vector u such 
that 1 1 p} is collinear with the ket | 4- >„. We therefore choose two complex 
numbers a and (1 which satisfy relation (B-2) but which are arbitrary in every other 
respect. Taking (B-2) into account, we find that there necessarily exists an angle 6 
such that: 

r 6 11 

cos- = |a| 

< 

. e 

[sm- = 


If, in addition, we impose: 
0 ^ 6 n 


(B-4) 


the equation tan — = 


£ 

a 


determines 6 uniquely. We already know that only the 


difference of the phases of a and p enters into the physical predictions. Let us 
therefore set : 


(p = Arg P - Arg a 
X = Arg P + Arg a 

We thus have: 


* o 1 1 

Arg p = ^ X + j 

a 1 1 

Arg a = j x ~ 2 


(B-5) 

(B-6) 


(B-7) 


* The direction of the atomic beam is no longer necessarily Oy„ but this is not important in what 
concerns us here. 


397 



CHAPTER IV SPIN 1/2 AND TWO-LEVEL SYSTEMS 


With this notation, the ket | \p ) can be written: 

| ^ > = e'2 jcos^e _, 2 | + > + sin-^e'2| - >J (B-8) 

If we compare this expression with formula (A-22-a), we see that | ij/ > differs from 
the ket | + >„ (which corresponds to the unit vector u characterized by 6 and (p) 
only by the phase factor e‘* /2 , which has no physical significance. 

Consequently, to prepare silver atoms in the state | ^ >, it suffices to place 
the Stern-Gerlach apparatus (with its plate pierced at N t ) so that its axis is 
directed along the vector u whose polar angles are determined from a and ft by (B-3) 
and (B-5). 


2. Spin measurements 


We saw in §A that a Stern-Gerlach apparatus enables us to measure the 
component of the angular momentum !f of silver atoms along a given axis. We 
have just pointed out, in § B-l, that an apparatus of the same type can be used to 
prepare an atomic beam in a given spin state. Consequently, if we place two 
Stern-Gerlach magnets one after the other, we can verify experimentally the 
predictions of the postulates. The first apparatus acts like a “ polarizer ” : the 
beam which comes out of it is composed of a large number of silver atoms all 
in the same spin state. This beam then enters the second apparatus, which is used 
to measure a specified component of the angular momentum this is, as it were, 
the “analyzer” (note the analogy with the optical experiment described in §A-3 of 
chapter I). We shall assume in this section that the spin state of the atoms of the 
beam does not evolve between the time they leave the “ polarizer” and the time 
they enter the “analyzer”, that is, between the preparation and the measurement. 
It would be easy to forgo this hypothesis, by using the Schrodinger equation to 
determine the spin evolution between the moment of preparation and the moment 
of measurement. 


a. FIRST EXPERIMENT 

Let us choose the axes of the two apparatuses parallel to Oz (fig. 7). The 
first one prepares the atoms in the state | + ) and the second one measures y z . 
What is observed on the plate of the second apparatus ? 

Since the state of the system under study is an eigenstate of the observable S z 
which we want to measure, the postulates indicate that the measurement result 
is certain : we find, without fail, the corresponding eigenvalue (+ S/2). Consequently, 
all the atoms of the beam must condense into a single spot on the plate of the 
second apparatus, at the position corresponding to + S/2. 

This is indeed what is observed experimentally : all the atoms strike (he 
second plate in the vicinity of A,, none hitting near N 2 . 
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FIGURE 7 

The first apparatus (a “ source ” composed of the furnace E t and the slit F t , plus a “ polarizer ” 
formed by the magnet A t and the pierced plate P t ) prepares the atoms in the state | + >. The second 
one (an “ analyzer” composed of the maguet A 2 and the plate P 2 ) measures the component The 
result obtained is certain (-I- ft/2). 


b. SECOND EXPERIMENT 

Now let us place the axis of the first apparatus along the unit vector u, with 
polar angles 0, <p = 0 (u is therefore contained in the xOz plane). The axis of the 
second apparatus remains parallel to Oz (fig. 8). According to (A-22-a), the spin 
state of the atoms when they leave the “polarizer” is: 

| > = cos| I + > + sin-^ | - > (B-9) 



The first apparatus prepares the spins in the state | +),(“»the unit vector of the xOz plane which 
makes an angle 6 with Oz). The second one measures the -f z component. The possible results are 
+ ft/2 (probability cos 2 0/2) and - Till (probability sin 2 6/2). 


The “analyzer” measures £f z on these atoms. What are the results? 

This time, we find that certain atoms condense at N x and others, at N 2 , although 
they have all been prepared in the same way : there is an indeterminacy in the 
behavior of each of the atoms taken individually. The postulate of spectral decom¬ 
position merely enables us to predict the probability of each atom’s appearance 
at or N 2 . Since (B-9) gives the expansion of the spin state of an atom in terms 
of the eigenstates of the observable being measured, we can calculate directly that 
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these probabilities are, respectively, cos 2 6/2 and sin 2 6/2. Thus, when enough atoms 
have condensed on the plate, we observe that the intensity of the spots at N 1 and N 2 
corresponds to numbers of atoms which are proportional, respectively, to cos 2 6/2 
and sin 2 6/2. 

comment : 

For any value of the angle 6 (except exactly 0 or n), it is therefore always 
possible to find the two results + ft/2 and — ft/2 in a measurement of S z . 
This prediction may seem to a certain extent paradoxical. For example, if 6 
is very small, the spin at the exit of the first apparatus points in a direction 
which is practically Oz, and yet one can find — ft/2 as well as + ft/2 in a measu- 



Nevertheless, the smaller 6, the smaller the probability of finding — ft/2. 
Moreover, we shall see later [formula (B-11)] that the mean value of the 
results which would be obtained in a large number of identical experiments 

is < S z ) = ^ cos 6, which corresponds to the classical result. 


C. THIRD EXPERIMENT 


Let us take a “polarizer” positioned as in §2-b, so as to prepare atoms in 
the state (B-9), and let us rotate the “analyzer” until its axis is directed 
along Ox, so that it measures the Sf x component of the angular momentum. 

To calculate the predictions of the postulates in this case, we must expand 
the state (B-9) in terms of the eigenstates of the observable S x [formula (A-20)]. We 
easily find: 


A + 
,<-!*> 


,, i / e . e\ fn e\ 

* >= vir s 2 +s “ , v =cos (4-2J 


1 (6 . 6 \ 

= _(c° s __ sln ,t = 


' 7t 6 

S “"4~2 


The probability of finding the eigenvalue + ft/2 of S x is therefore cos 
and that of finding — ft/2, sin 2 fj — ^). 


(B-10) 


■(H) 


It is possible to verify these predictions by measuring the intensity of the 
two spots on the plate situated at the exit of the second Stern-Gerlach apparatus. 


comment : 

The fact that it is ^ — -^J that enters in here is not at all surprising : 

in § 2-b, the angle between the axes of the two apparatuses was 6; it 
became 0^ — d \after rotation of the second apparatus. 
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d. MEAN VALUES 


In the situation of § 2-b, we find experimentally that, of a great number .A 
of atoms, Jf cos 2 6/2 arrive at N t and Jf sin 2 6/2, at N 2 . The measurement of Sf z 
therefore yields + h/2 for each of the first group and — ft/2 for each of the second. 
If we calculate the mean value of these results, we obtain: 


< s =>=^ 


Ti A/ . 2 6 h ... . 

— x J/ cos — — — x sin 
2 2 2 


■ 2 0“| 
in2 -J 


= — cos 6 


(B-ll) 


It is easy to verify from formulas (B-9) and (A-10) that this is indeed the value of 
the matrix element < i //1 S z 1 1 // >. 

Similarly, the average of the measurement results obtained in the experiment 
of § 2-c is equal to : 



(B-12) 


To calculate the matrix element (if/\S x \\l/}, we can use the matrix (A-16) which 
represents S x in the { | + >, | — > } basis. In this same basis, the ket 1 1 // > is repre¬ 
sented by the column vector ( COS ^j, and the bra < i/f | by the corresponding 

\sin 6/2 J 


row vector. We therefore have: 

=|(cos0/2 

= sin 0 


sin 6/2) 


c:)( 


cos 6 /2 
sin 6/2 


(B-13) 


The mean value of Sf x is indeed equal to the matrix element, in the state 1 1 //), 
of the associated observable S x . 

It is interesting to note that if we were dealing with a classical angular 
momentum of modulus ft/2 directed along the axis of the “polarizer”, its components 

It h 

along Ox and Oz would be precisely — sin 6 and — cos 6. More generally, if we 

calculate [using the same technique as in (B-13)] the mean values of S x , S y and S z 
in the state | + >„ [formula (A-22-a)], we find: 

u< + | S x I + >„ = 2 sin o cos cp 
„< + | 5,, | + >„ = ^ sin 6 sin <p 

u < + |S 2 | + > u =*cosd (B-14) 


These mean values are equal to the components of a classical angular momentum 
of modulus h/2 oriented along the vector u whose polar angles are 6 and q>. Therefore, 
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we can also establish here a relation between classical mechanics and quantum 
mechanics through the mean values. However, we must not lose sight of the fact 

that a measurement of 9 ? x , for example , on a given atom will never yield ^ sin 8 cos q>: 

the only results which can be found are + Till and — fi/2. Only in taking the 
average of values obtained in a large number of identical measurements (same 
state of the system, here | + >„, and same observable measured, here S x ) do we 

obtain — sin 6 cos tp. 

comment: 

It is useful to consider again at this stage the problem of external degrees of 
freedom (position, momentum). 

When a silver atom enters the second Stem-Gerlach apparatus in the spin 
state | \f/ ) given by (B-9), we have just seen that it is impossible to predict with certainty 
whether it will condense at N t or N 2 . It seems difficult to reconcile this indeterminacy 
with the idea of a perfectly well-determined classical trajectory, given the initial state 
of the system. 

In fact, this is not a real paradox. To say that the external degrees of freedom can 
be treated classically means only that it is possible to form wave packets which are much 
smaller than all the dimensions of the problem. It does not necessarily mean, as we shall 
see, that the particle itself follows a classical trajectory. 

Let us first consider a silver atom which enters the apparatus in the initial spin 
state | + >. The wave function which describes the external degrees of freedom of this 
particle is a wave packet whose spread is very small and whose center follows the 



When the spin is in the state | + > (fig. a) or | - > (fig. b), the center of the wave packet follows a well- 
defined trajectory which can be calculated classically. When the spin state is a linear superposition of 
| + > and | — >, the wave packet splits into two parts and it is no longer possible to say that the atom 
follows a classical trajectory (despite the fact that the spread of each of the packets is much smaller 
than the characteristic dimensions of the problem). 
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classical trajectory of figure 9-a. Similarly, if the silver atom enters with the spin state | - >, 
the center of the wave packet associated with it follows the classical trajectory of figure 9-b. 

If we now consider an atom which enters with the spin state | \j/ > of formula (B-9), 
the corresponding initial state is a perfectly well-defined linear superposition of the two 
preceding initial states. Since the Schrodinger equation is linear, the wave function of the 
particle at a subsequent instant (fig. 9-c) is a linear superposition of the two wave packets 
of figures 9-a and 9-b. The particle therefore has a certain probability amplitude of being 
in one or the other of these two wave packets. We see that it does not follow a classical 
trajectory at all, unlike what happens to the centers of the two wave packets. Upon 
arrival on the screen, the wave function has non-zero values in two different regions, 
each very localized, around the points N t and N 2 . The particle can therefore appear 
either near N t or near N 2 , and we cannot predict with certainty at which of these two 
points it will arrive. 

Note that the two wave packets of figure 9-c do not represent two different 
particles; they represent only one particle, whose wave function has two parts, each of 
which is very localized about a different point. The two wave packets, moreover, 
have a well-defined phase relation because they arise from the same initial wave packet, 
split into two under the influence of the gradient of B. We could recombine them to form 
one wave packet again by removing the screen (that is, by not performing the measure¬ 
ment) and by submitting them to a new field gradient, whose sign would be the opposite 
of the first one. 


3. Evolution of a spin 1/2 particle 
in a uniform magnetic field 

a. THE INTERACTION HAMILTONIAN AND THE SCHRODINGER EQUATION 

Consider a silver atom in a uniform magnetic field B 0 , and choose the Oz axis 
along B 0 . The classical potential energy of the magnetic moment Jf = y of this 
atom is then : 

W = -J( ¥ 0 = - J( z B 0 - - yB o y z (B-15) 

where B 0 is the modulus of the magnetic field. Let us set: 

o>„ = - yB 0 (B-16) 

It is easy to see that <u 0 has the dimensions of the inverse of a time, that is, of an 
angular velocity. 

Since we are quantizing only the internal degrees of freedom of the particle, 
must be replaced by the operator S z , and the classical energy (B-15) becomes 
an operator : it is the Hamiltonian H which describes the evolution of the spin 
of the atom in the field B 0 : 

H = (o 0 S z (B-17) 

Since this operator is time-independent, solving the corresponding Schrodinger 
equation amounts to solving the eigenvalue equation of H. We immediately see 
that the eigenvectors of H are those of S z : 

H | + >- +^| + > 

(B-18) 

«l->--*?!-> 


403 



CHAPTER IV SPIN 1/2 AND TWO-LEVEL SYSTEMS 


There are therefore two energy levels, E + = + TuoJ 2 and E_ = — hoj 2 (fig. 10). 
Their separation Tia> 0 is proportional to the magnetic field; they define a single 
“ Bohr frequency 




(B-19) 


A 


E 


E + 

E_ 


^° >0 FIGURE 10 

; Energy levels of a spin 1/2, of gyromagnetic ratio y, 

placed in a magnetic field B 0 parallel to Oz; ,u> 0 is 
defined by co 0 = — yB 0 . 


COMMENTS : 

(/) If the field B 0 is parallel to the unit vector u whose polar angles are 6 and <p, 
relation (B-17) must be replaced by: 

H = (o 0 S u (B-20) 

where S u = S . u is the component of S along u. 

(ii) For the silver atom, y is negative; eo 0 is therefore positive, according to (B-16). 
This explains the arrangement of the levels in figure 10. 


b. LARMOR PRECESSION 

Let us assume that, at time t = 0, the spin is in the state: 

11//(0) > = cos|e-* >/2 1 + > + sin| e ,>/2 | - > (B-21) 

(we showed in § B-l-c that any state could be put in this form). To calculate the 
state |t/'(f)> at an arbitrary instant t > 0, we apply the rule (D-54) given in 
chapter III. In expression (B-21), | ^(0)) is already expanded in terms of the 
eigenstates of the Hamiltonian, and we therefore obtain: 

1 1 l/{t) > = cos|e _, 2 e“' £+ » | + > + sin-^e'2 e ,£ ‘* | - > (B-22) 

or, using the values of E+ and E_ : 

| \p{t) > = cos|e-''^ + “ 0, ' /2 | + > + sin|e i( < ,+ “ o0/2 | - > (B-23) 

The presence of the magnetic field B 0 therefore introduces a phase shift, proportional 
to the time, between the coefficients of the kets | + > and | - >. 
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Comparing expression (B-23) for | ip(t)} with that for the eigenket | + } u 
of the observable S . u [formula (A-22-a)], we see that the direction u(?) along 
which the spin component is + S/2 with certainty is defined by the polar angles : 

r o( t ) = e 

1 <p{t) = <P + <o 0 t {K-24) 

The angle between u (t) and Oz (the direction of the magnetic field B 0 ) therefore 
remains constant, but u(f) revolves about Oz at an angular velocity of co 0 (propor¬ 
tional to the magnetic field). Thus, we find in quantum mechanics the phenomenon 
which we described for a classical magnetic moment in §A-l-b, and which, bears 
the name of Larmor precession. 

From expression (B-17) for the Hamiltonian, it is obvious that the obser¬ 
vable S z is a constant of the motion. It can be verified from (B-23) that the proba¬ 
bilities of obtaining + S/2 or — S/2 in a measurement of this observable are 
time-independent. Since the modulus of e ±i(<l,+ " 0 O /2 e q Ua [ t0 j these probabilities 
are equal, respectively, to cos 2 6 /2 and sin 2 6/2. The mean value of S z is also time- 
independent : 

(iJ,(t)\S z \*Ht)> = *cosd (B-25) 

On the other hand, S x and S y do not commute with H [it is easy to show 
this by using the matrices which represent S x , S y and S z given in (A-15), (A-16) 
and (A-17)]. Thus, formulas (B-14) here become: 

< 1 p(t) | S x | tftt) > .= * sin 6 cos (cp + (o 0 t) 

(B-26) 

< I s y | ^(0 > = 2 sin 6 sin (v + °>oO 

In these expressions, we again find the single Bohr frequency a> 0 /2n of the system. 
Moreover, the mean values of S x , S y and S z behave like the components of a 
classical angular momentum of modulus h/2 undergoing Larmor precession. 


C. GENERAL STUDY OF TWO-LEVEL SYSTEMS 

The simplicity of the calculations presented in §B derives from the fact 
that the state space has only two dimensions. 

There exist numerous other cases in physics which, to a first approximation, 
can be treated just as simply. Consider, for example, a physical system having two 
states whose energies are close together and very different from those of all other 
states of the system. Assume that we want to evaluate the effect of an external 
perturbation (or of internal interactions previously neglected) on these two states. 
When the intensity of the perturbation is sufficiently weak, it can be 
shown (cf. chap. XI) that its effect on the two states can be calculated, to a first 
approximation, by ignoring all the other energy levels of the system. All the 
calculations can then be perfomed in a two-dimensional subspace of the state 
space. 
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In this section, we shall study certain general properties of two-level 
systems (which are not necessarily spin 1/2 particles). Such a study is interesting 
because it enables us, using a mathematically simple model, to bring out some 
general and important physical ideas (quantum resonance, oscillation between two 
levels, etc...). 


1. Outline of the problem 


a. NOTATION 


Consider a physical system whose state space is two-dimensional (as we 
have already pointed out, this is usually an approximation: under certain condi¬ 
tions, we can confine ourselves to a two-dimensional subspace of the state space). 
For a basis, we choose the system of the two eigenstates | (p y > and | q> 2 ) of the 
Hamiltonian H 0 whose eigenvalues are, respectively, E i and E 2 : 

Ho | <Pi > = £ i | <Pi > ^0.jj 

h 0 | *^2 y = £ 21 2 y 

This basis is orthonormal: 


< <Pi | (Pj y = &u ; i, j = U 2 (C-2) 

Assume that we want to take into account an external perturbation or 
interactions internal to the system, initially neglected in H 0 . The Hamiltonian 
becomes: 

H = H 0 + W (C-3) 


The eigenstates and eigenvalues of H will be denoted by | ip ± > and E ± : 

H\ip + y = E + \ip + > (C-4) 

> = E_ j^_ > 

H 0 is often called the unperturbed Hamiltonian and W , the perturbation 
or coupling. We shall assume here that W is time-independent. In the { | (p t ), | <p 2 > } 
basis of eigenstates of H 0 (called unperturbed states), W is represented by a 
Hermitian matrix: 



W ll and W 12 are real. Moreover: 

W l2 = W* (C-6) 


In the absence of coupling, E x and E 2 are the possible energies of the system, 
and the states | <p t ) and | tp 2 ) are stationary states (if the system is placed in one 
of these two states, it remains there indefinitely). The problem consists of evaluating 
the modifications which appear when the coupling W is introduced. 
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b. CONSEQUENCES OF THE COUPLING 

a. E x and E 2 are no longer the possible energies of the system 

A measurement of the energy of the system can yield only one of the two 
eigenvalues E + and E_ of H, which generally differ from E x and E 2 . 

The first problem which arises therefore consists of calculating E + and E_ 
in terms of E x , E 2 and the matrix elements W ij of W. This amounts to studying 
the effect of the coupling on the position of the energy levels. 

(3. | q> v ) and | <p 2 ) are no longer stationary states 

Since | <p t ) and | <p 2 ) are not generally eigenstates of the total Hamiltonian H, 
they are no longer stationary states. If, for example, the system at time t = 0 
is in the state \q>y >, there is a certain probability & l2 (t) of finding it in the 
state | (p 2 > at time t: W therefore induces transitions between the two unperturbed 
states. Hence the name “coupling” (between | <p, > and | <p 2 )) given to W. 

This dynamic aspect of the effect of W constitutes the second problem with 
which we shall be concerned. 


comment: 

In complement C IV , the two problems we have just cited are considered by 
introducing the concept of a fictitious spin. It can indeed be shown that the Hamil¬ 
tonian H to be diagonalized has the same form as that of a spin 1/2 placed in a static 
magnetic field B, whose components B x , B y and B, are simply expressed in terms of E lt E 2 
and the matrix elements By. In other words, with every two-level system (not neces¬ 
sarily a spin 1/2), can be associated a spin 1/2 (called a fictitious spin) placed in a static 
field B and described by a Hamiltonian of identical form. All the results related to two- 
level systems which we are going to establish in this section can be interpreted in 
a simple geometric way in terms of magnetic moment, Larmor precession, and the 
various concepts introduced in §§ A and B of this chapter in connection with spin 1 /2 
particles. This geometrical interpretation is developed in complement C, v . 


2. Static aspect: effect of coupling 

on the stationary states of the system 


EXPRESSIONS FOR THE EIGENSTATES AND EIGENVALUES OF H 

In the { |<p, >, \(p 2 > } basis, the matrix representing H is written: 


(H) 


= (E 1 + W U W l2 \ 
l W 2l E 2 + W 22 ) 


(C-7) 


The diagonalization of matrix (C-7) presents no problems (it is performed 
in detail in complement B IV ). We find the eigenvalues: 


1 


E + = 1 (E l + Wy y + E 2 + W 22 ) +1 ^{Ey + vn, - E 2 — W 22 ) 2 + 4|W' , 12 | 2 

E- = \(Ey + Wyy + E 2 + W 22 ) - ^ sj(E i + Wyy-E 2 -W 22 ) 2 + 4\Wy 2 \ 2 

(C-8) 
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(it can be verified that if W = 0, E + and E_ are identical to £, and E 2 *). The 
eigenvectors associated with E + and £_ are written : 

| <P + > = cos^e-^ 2 | (p t > + sin|e 1>/2 | q> 2 > (C-9-a) 

| <A_ > = - sin|e“ iW2 | > + cos|e ,>/2 | <p 2 > (C-9-b) 

where the angles 9 and q> are defined by: 


tan 6 = -- 

E l + - E 2 - W 22 

w 2l = \w 2l \e* 


with 0 ^ 9 < n 


(C-10) 

(C-ll) 


b. DISCUSSION 


a. Graphical representation of the effect of coupling 

All the interesting effects which we shall discuss later arise from the fact 
that the perturbation W possesses non-diagonal matrix elements W l2 = W* t 
(if W l2 = 0, the eigenstates of H are the same as those of H 0 , the new eigenvalues 
being simply + W lt and E 2 + 1F 22 ). To simplify the discussion, we shall 
therefore assume from now on that the matrix (W) is purely non-diagonal, that 
is, that fV n = W 22 = 0**. Formulas (C-8) and (C-10) then become: 


E + — -l (E 1 + E 2 ) +\s/{E, - E 2 f +4\W l2 \ 2 

(C-12) 

E- =\{E, +£ 2 )-|V(£ 1 -E 2 ) 2 + 4|^ 12 | 2 

tan 9 = ~ 0 ^ 9 < n (C-13) 

fci — t 2 


We now intend to study the effect of the coupling W on the energies E + 
and E_ in terms of the values of E i and E 2 . Assume that W 12 is fixed and introduce 
the two parameters: 

E m = 2 (^1 E 2 ) 

(C-14) 

^1 =-(E 1 - E 2 ) 


We see immediately from (C-12) that the variation of E + and E with respect 
to E m is extremely simple: changing E m reduces to shifting the origin along the 
energy axis. Moreover, it can be verified from (C-9), (C-10) and (C-ll) that the 


* If E t > E 2) E + approaches E t and E_ approaches E 2 when W approaches zero. On the other 
hand, if E v < E 2> E + approaches E 2 and £_ approaches E t . 

** If W ti and W 21 are non-zero, we simply set: E t = £, 4- W ily E 2 = E 2 + W 22 . All the 
results obtained in this section then remain valid if we replace E t and E 2 by E t and E 2 . 
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vectors | > and | ip _ > do not depend on E m . We are therefore concerned only 

with the influence of the parameter A. Let us show on the same graph, in terms 
of A, the four energies E x , E 2 , E + and E_. We thus obtain for E x and E 2 two 
straight lines of slope + 1 and — 1 (shown in dashed lines in figure 11). Substi¬ 
tuting (C-14) into (C-12), we find: 


E + 

E_ 


— E m + v A 2 4- W x 2 


= E m - + \W 12 f 


(C-15) 

(C-16) 


When A varies, E + and E l describe the two branches of a hyperbola which is 
symmetrical with respect to the coordinate axes and whose asymptotes are the 
two straight lines associated with the unperturbed levels; the minimum separation 
between the two branches is 2|FL’ 12 | (solid lines in figure 11)*. 



Variation of the energies E + and E_ with respect to the energy difference A = (E x — Zs 2 )/2. In the 
absence of coupling, the levels cross at the origin (dashed straight lines). Under the effect of the non¬ 
diagonal coupling W, the two perturbed levels “ repel each other ” and we obtain an “ anti-crossing ”: 
the curves giving E + and E_ in terms of A are branches of a hyperbola (solid lines in the figure) whose 
asymptotes are the unperturbed levels. 


(3. Effect of the coupling on the position of the energy levels 

In the absence of coupling, the energies E i and E 2 of the two levels “cross” 
at A = 0. It is clear from figure 11 that under the effect of coupling, the two 
levels “repel each other”— that is, the energy values move further away from each 
other. The diagram in solid lines in figure 11 is often called, for this reason, an 
anti-crossing diagram. 

* It is clear from figure 11 why, when W —> 0 : 

E + > E t ,E. > E 2 if E x > E 2 

E+ E 2 , E _ > E y if E x < E 2 
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Also, we see that, for any A, we always have: 


\E + -E_\>\E i - E 2 1 


This is a result which appears rather often in other domains of physics (for 
example, in electrical circuit theory): the coupling separates the normal frequencies. 
Near the asymptotes, that is, for \A\ > \ W l2 \, formulas (C-15) and (C-16) can 


be written in the form of a limited power series expansion in 


A 


E + 

£_ 


— E m + A 
= E m -A 


(\ +^r 

Wu 


V 2 

H 

A 

w i2 

A 

) 


(C-17) 


On the other hand, at the center of the hyperbola, for E 2 = E i (A = 0), for¬ 
mulas (C-15) and (C-16) yield: 


E + =E m + \W 12 1 
E_=E m - \W l2 \ 


(C-18) 


Therefore, the effect of the coupling is much more important when the two 
unperturbed levels have the same energy. The effect is then of first order, as can 
be seen from (C-18), while it is of second order when A > \W l2 \ [formulas (C-17)]. 


y. Effect of the coupling on the eigenstates 

When (C-14) is used, formula (C-13) becomes: 

A 


(C-19) 


It follows that, when A < | 1 (strong coupling), 6 as n/2; on the other hand, 

when A > | ^ 12 ! (weak coupling), 8^0 (assuming A ^ 0). 

At the center of the hyperbola, when E 2 = E v (A = 0), we have: 


I *A+ > = 


|*-> = 


1 

1 

72 


[e-^ 2 1 </>,) + e‘> /2 | cp 2 >] 

[- e"^ 2 1 <p, > + e l>/2 1 <p 2 >] 


(C-20) 


while near the asymptotes (that is, for A > \W l2 \), we have, to first order 

initial /A’ 


| * + > = e- f ” /2 [j ?, > + | <P2 > + ... 

I*- > = z ivl2 |j (Pi > - I <Pi > + - 


(C-21) 


In other words, for a weak coupling (£, — E 2 | IT, 2 1), the perturbed states 
differ very slightly from the unperturbed states. We see from (C-21) that to 
within a global phase factor e"' >/2 , \i(/ + } is equal to the state | <p, > slightly 
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“contaminated” by a small contribution from the state | q> 2 >. On the other hand, 
for a strong coupling (E x — E 2 < \W l2 \\ formulas (C-20) indicate that the 
states |</'+> and |t//_ > are very different from the states | <p, > and |<p 2 >, since 
they are linear superpositions of them with coefficients of the same modulus. 

Thus, like the energies, the eigenstates undergo significant modifications 
in the neighborhood of the point where the two unperturbed states cross. 


c. IMPORTANT APPLICATION : 

THE PHENOMENA OF QUANTUM RESONANCE 

When Ej = E 2 = E m , the corresponding energy of H 0 is two-fold dege¬ 
nerate. As we have just seen, the coupling W l2 lifts this degeneracy and, in 
particular, gives rise to a level whose energy is lowered by \ W l2 \. In other words, 
if the ground state of a physical system is two-fold degenerate (and sufficiently 
far from all the other levels), any (purely non-diagonal) coupling between the 
two corresponding states lowers the energy of the ground state of the system, 
which thus becomes more stable. 

As a first example of this phenomenon, we shall cite the resonance stabili¬ 
zation of the benzene molecule C 6 H 6 . Experiments show that the six carbon 
atoms are situated at the vertices of a regular hexagon, and we would expect the 
ground state to include three double bonds between neighboring carbon atoms. 
Figures 12-a and 12-b represent two possible dispositions of these bonds. The 
nuclei are assumed here to be fixed because of their high masses. Thus, the elec¬ 
tronic states \(p x ) and | q> 2 ), associated with figures 12-a and 12-b respectively, 
are different. If the structure of figure 12-a were the only one possible, the 


a b 



FIGURE 12 

Two possible configurations of the double 
bonds in a benzene molecule. 


ground state of the electronic system would have an energy of E m = 
where H is the Hamiltonian of the electrons in the potential created by the nuclei. 
But the bonds can also be placed as shown in figure 12-b. By symmetry, we 
obviously have < 1 //1 <^> 2 > = < <p t | H\ <pj >, and we could conclude that the 

ground state of the molecule is doubly degenerate. However, the non-diagonal 
matrix element ((p 2 \H\(p l '> of the Hamiltonian H is not zero. This coupling 
between the states |<pj> and | <p 2 ) gives rise to two distinct levels, one of which 
has an energy lower than E m . The benzene molecule is therefore more stable 
than we would have expected. Moreover, in its true ground state, the configuration 
of the molecule cannot be represented either by figure 12-a or by figure 12-b : this 
state is a linear superposition of | > and | <p 2 ) [the coefficients of this super- 
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position having, as in (C-20), the same modulus]. This is what is symbolized by the 
double arrow of figure 12, commonly used by chemists. 

Another example is that of the (ionized) molecule H 2 , composed of two 
protons Pi and p 2 and one electron. The two protons, because of their large 
masses, can be considered to be fixed. Let us call R the distance between them 
and |<pj> and | <p 2 ), the states where the electron is localized around p t or 
around p 2 , its wave function being that of the hydrogen atom it would form 
with Pi or p 2 (fig. 13). As above, the diagonal elements <<P!|#|<Pi> and 


a b 



FIGURE 13 


In the H J ion, the electron could logically be localized either around the proton p t (fig. a) or around 
the proton p 2 (fig. b). In the ground state of the ion, the wave function of the electron is a linear super¬ 
position of the wave functions associated with figures a and b. Its probability of presence is symmetrical 
with respect to the plane bisecting p t p 2 . 


< 1 H I <P 2 ) °f the Hamiltonian are equal because of symmetry; we shall denote 

them by E m (R). The two states | q > x ) and | <p 2 ) are not, however, stationary 
states, since the matrix element < <p, | H\ <p 2 > is not zero. Here again, we obtain 
an energy level lower than E m (R) and, in the ground state, the wave function 
of the electron is a linear combination of those of figures 13-a and 13-b. The 
electron is thus no longer localized about one of the two protons alone, and it is 
this delocalization which, by lowering its potential energy, is responsible for the 
chemical bond* 


3. Dynamical aspect: oscillation of the system 
between the two unperturbed states 

a. EVOLUTION OF THE STATE VECTOR 

Let : 

I > = «i(0 I <Pi > + a i(t) | (p 2 > (C-22) 

be the state vector of the system at the instant t. The evolution of |i//(f) > in the 
presence of the coupling W is given by the Schrodinger equation: 

= (H 0 + W)\+(t)> (C-23) 


* A more elaborate study of the ionized molecule H 2 will be presented in complement G X1 . 


412 



C. TWO-LEVEL SYSTEMS 


Let us project this equation onto the basis vectors | <p t > and | q> 2 >. We 
obtain, using (C-5) (where we have set W X1 = W 22 = 0) and (C-22): 

d 

in Tt a ^ = + w n MO 

d (C ' 24) 

= ^ 21 « l (0 + £ 2« 2 (0 

If 1^21 # 0, these equations constitute a linear system of homogeneous coupled 
differential equations. The classical method of solving such a system reduces, 
in fact, to the application of rule (D-54) of chapter III : look for the eigen¬ 
vectors 1 1 // + > (eigenvalue E+ ) and | (eigenvalue E_) of the opera¬ 

tor H = H 0 + W [whose matrix elements are the coefficients of equations (C-24)], 
and decompose | ^(0)> in terms of | ip+ > and 1 1 > : 

|*(0)> = > + n\^~ > (C-25) 

(where X and \i are fixed by the initial conditions). We then have : 

| t(t) > = X e" i£ + ,/fi |« 1/+} + n e- ,£ -' /fi | ip_ > (C-26) 

[which enables us to obtain a^t) and a 2 (t) by projecting \ij/(t)} onto |<P!> 
and | q> 2 >]. 

It can be shown that a system whose state vector is the vector | \p{t)} given 
in (C-26) oscillates between the two unperturbed states l^) and | <p 2 ). To see 
this, we shall assume that the system at time I = 0 is in the state | >: 

| ^(0) > = |</>,> (C-27) 

and calculate the probability ^ 12 (t) of finding it in the state [ <p 2 > at time t. 


b. CALCULATION OF ^ 12 (f) ; RABI'S FORMULA 

As in (C-25), let us therefore expand the ket |^(0)> given in (C-27) 
on the { \ij/+ >, |i^_ ) } basis. Inverting formulas (C-9), we obtain: 


| m > = !<?!>= e‘ >/2 cos||* + > - sin 11 > 


(C-28) 


from which we deduce, using (C-26): 


\Ht)y = e‘ >/2 ^cos|e- iB + ,/ft | ^ + > - sine“ fE -| ^_ >J (C-29) 

The probability amplitude of finding the system at time t in the state | q> 2 > 
is then written: 

< <Pi | <A(0 > = e iW2 ^cos|e~ i£ + ,/S < (p 2 | > - sin|e _ ' £ -' /ft < q> 2 | - >J 

= e'* sin-^j cos-^ [e _i£ * t,H — e _ ' £ -‘ /fi ] (C-30) 
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which enables us to calculate & l2 (t) = | < <p 2 1 <A(0> | 2 - We thus find: 


^ 12(0 = 2 sin2e 


I - cos ^ 2 - 


- E. 


• 2a ■ 2 / £ + — E_ 

= sin z 9 sin- 2 1 — 2 -^- t 


or, using expressions (C-12) and (C-13): 


^.2 (0 = 


4 W i2 


sin 


V 7 4 | W x 


4jl%2f + (£, - E 2 ) 2 

Formula (C-32) is sometimes called Rabi’s formula. 


2I 2 + (E , - E 2 y^ 


(C-31) 


(C-32) 


c. DISCUSSION 

We observe from (C-31) that 0* l2 {t) oscillates over time with a frequency 
of ( E + — E_)/h, which is simply the unique Bohr frequency of the system. 
& l2 (t) varies between zero and a maximum value which, according to (C-31), is 
equal to sin 2 0 and is attained for all values of /such that t = (2 k + \)nTij(E+ — E_), 
with k = 0, 1,2, ...(fig. 14). 


|^i2(0 

FIGURE 14 



Variation with respect to time of 
the probability ^ 12 (l) of finding 
the system in the state | <p 2 > when 
it was initially in the state | < p i ). 
When the states | (p i > and | <p 2 > 
have the same unperturbed energy, 
the probability & xl {t) can attain 
the value 1. 


nh/(E + - E.) 


The oscillation frequency (E + — E_ )/h, as well as the maximum value sin 2 9 
of are functions of | IT, 2 | and E 1 — E 2 , whose main features we are now 

going to describe. 

When E 2 = E 2 , ( E + — E_)/h is equal to 2\W l2 \jh, and sin 2 6 takes on its 
greatest possible value, that is, 1 : at certain times, t — (2k + l)n7il2\W l2 \, the 
system (which started from the state | >) is in the state | <p 2 >. Therefore, any 
coupling between two states of equal energy causes the system to oscillate completely 
from one state to the other with a frequency proportional to the coupling*. 


* The same phenomenon is found in other domains of physics. Consider, for example, two iden- 
tical pendulums (1) and (2), suspended from the same support and having the same frequency. Let us 
assume that at time t — 0, pendulum (I) is set in motion. Because of the coupling assured by the 
common support, we know (cf. complement H v ) that after a certain time (the stronger the coupling, the 
shorter the time), we shall arrive at the complementary situation, in which only pendulum (2) is oscillating 
with the initial amplitude of pendulum (1), and so on. 
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When E x — E 2 increases, ( E+ — E_)/h does also, while sin 2 6 decreases. For 
a weak coupling (E x — E 2 > llf^l), E+ — E- differs very little from E x — E 2 , 
and sin 2 6 becomes very small. This last result is not surprising since, in the case 
of a weak coupling, the state | (p x > is very close to the stationary state 
| ip + > \cf formulas (C-21)] : the system, having started in the state | >, 

therefore evolves very little over time. 


d. EXAMPLE OF OSCILLATION BETWEEN TWO STATES 

Let us return to the example of the H 2 molecule. We shall assume that, at 
a certain time, the electron is localized about proton p x : it is, for example, 
in the state shown in figure 13-a. According to the results of the preceding 
section, we know that it will oscillate between the two protons with a frequency 
equal to the Bohr frequency associated with the two stationary states j \jj + > 
and | ip- } of the molecule. To this oscillation of the electron between the two 
states represented in 13-a and 13-b, corresponds an oscillation of the mean value 
of the electric dipole moment of the molecule (the dipole moment is non-zero when 
the electron is localized about one of the two protons and changes signs depending 
on whether the proton involved is p x or p 2 ). Thus we see concretely how, when 
the molecule is not in a stationary state, an oscillating electric dipole moment can 
appear. Such an oscillating dipole can exchange energy with an electromagnetic 
wave of the same frequency. Consequently, this frequency must appear in the 
absorption and emission spectrum of the H 2 ion. 

Other examples of oscillations between two states are discussed in 
complements F IV , G 1V and H IV . 


References and suggestions for further reading : 

The Stern-Gerlach experiment : original article (3.8); Cagnac and Pebay-Peyroula 
(11.2)* chap. X; Eisberg and Resnick (1.3), §8-3; Bohm (5.1), §§22.5 and 22.6; 
Frisch (3.13). 

Two-level systems : Feynman III (1.2), chaps. 6, 10 and 11; Valentin (16.1), 
Annexe XII; Allen and Eberly (15.8), particularly chap. 3. 
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COMPLEMENTS OF CHAPTER IV 


Af V : THE PAULI MATRICES 

B, v : DIAGON ALIZATION 

OF A 2 x 2 HERMITIAN MATRIX 

A, v , B, v : technical study of 2 x 2 matrices; 
simple, and important for solving numerous 
quantum mechanical problems. 

C lv : FICTITIOUS SPIN 1/2 ASSOCIATED 

WITH A TWO-LEVEL SYSTEM 

C, v : establishes the close relation which exists 
between §§B and C of chapter IV; supplies a 
simple geometrical interpretation of the proper¬ 
ties of two-level systems (easy, but not indispen¬ 
sable for what follows). 

D| V ■ SYSTEM OF TWO SPIN 1/2 PARTICLES 

D IV : simple illustration of the tensor product 
and the postulates of quantum mechanics (can be 
considered to be a set of worked exercises). 

E IV : SPIN 1/2 DENSITY MATRIX 

E IV : illustration, in the case of spin 1/2 particles, 
of concepts introduced in complement E ni . 

F| V ‘ SPIN 1/2 PARTICLE 

IN A STATIC MAGNETIC FIELD 

AND A ROTATING FIELD : 

MAGNETIC RESONANCE 

F, v : study of a very important physical pheno¬ 
menon with many applications: magnetic reso¬ 
nance; can be studied later. 

G,v> A SIMPLE MODEL 

OF THE AMMONIA MOLECULE 

G, v : example of a physical system whose 
study can be reduced, in a first approximation, 
to that of a two-level system; moderately dif¬ 
ficult. 

H IV : COUPLING BETWEEN A STABLE STATE 

AND AN UNSTABLE STATE 

H| V : study of the influence of coupling between 
two levels with different lifetimes; easy, but 
requires the concepts introduced in comple¬ 
ment K ni . 

Jiv: EXERCISES 
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Complement A, v 
THE PAULI MATRICES 


1. Definition; eigenvalues and eigenvectors 

2. Simple properties 

3. A convenient basis of the 2x2 matrix space 


In §A-2 of chapter IV, we introduced the matrices which represent the three 
components S x , S y and S z of a spin S in the { | + >, | - > } basis (eigenvectors 
of S z ). In quantum mechanics, it is often convenient to introduce the dimen¬ 
sionless operator o, proportional to S and given by: 



The matrices which represent the three components of a in the { | + ), | — ) } 
basis are called the “ Pauli matrices”. 


1. Definition; eigenvalues and eigenvectors 

Let us go back to equations (A-15), (A-16) and (A-17) of chapter IV. 
Using (1), we see that the definition of the Pauli matrices is: 



These are Hermitian matrices, all three of which have the same 
characteristic equation: 

A 2 - 1 = 0 (3) 

The eigenvalues of <r x , o y and o z are therefore: 

A = ± 1 (4) 

which is consistent with the fact that those of S x , S y and S z are ± /i/2. 

It is easy to obtain, from definition (2), the eigenvectors of a x , <r y and a z . 
They are the same, respectively, as those of S x , S y and S z , already introduced 
in § A-2 of chapter IV: 


ff x \ ± >X = ± 

| ± >* 


<*y | ± >y = ± 

i ± >x 

(5) 

1+ 

II 

1+ 

l±) 
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with: 


±>,-^[| + >±l->] 
±> »=^[| + >± 'l- > ] 


( 6 ) 


2. Simple properties 

It is easy to see from definition (2) that the Pauli matrices verify the 


relations: 

Det (<7j) = - I j = x, y or z (7) 

Tr [<yj) = 0 (8) 

o\ - - G \ - 1 (where / is the 2 x 2 unit matrix) (9) 

a x°y = - a y a x = ia z ( 10 ) 

as well as the equations which can be deduced from (10) by cyclic permutation 
of x, y and z. 

Equations (9) and (10) are sometimes condensed into the form: 

G i a k = &jk l ( 11 ) 

/ 


where e jkl is antisymmetric with respect to the interchange of any two of its indices. 
It is equal to: 

0 if the indices j, k, l are not all different 
e jk/— v 1 if j, k, l is an even permutation of x, y, z (12) 

— 1 if j, k, l is an odd permutation of x, y, z 

From (10), we immediately conclude: 

!>*.<!= 2 ia z (13) 

(and the relations obtained by cyclic permutation). This yields: 

[S x ,S y ] = ms, 

is, t s,-] = ms x ( 14 ) 

[S Z ,S X ] = ms. 

We shall see later (cf chap. VI) that equations (14) are characteristic of an 
angular momentum. 

We also see from (10) that: 

a x a y + a y a x = 0 (15) 

(the <T, matrices are said to anticommute with each other) and that, taking (9) 
into account: 

o x o y o z = i I ( 16 ) 


418 




THE PAULI MATRICES 


Finally, let us mention an identity which is sometimes useful in quantum 
mechanics. If A and B denote two vectors whose components are numbers (or 
operators which commute with all operators acting in the two-dimensional spin 
state space) : 

(#.A)(».B) = A.B/+w.(AxB) ( 17 ) 

We see that, using formula (11) and the fact that A and B commute with <*, we can 


write : 


(<r . A)(<t . B) = ^ Oj A j a k B k 

j.k 


= Z A j B k^i Z + Sjk 
= Z i(j i IZ + Z A i B i 1 

i L/,k J j 


In the second term, we recognize the scalar product A . B. In addition, it is easy 
to see from (12) that Z e jki A j B k * s the /th component of the vector product 

j.k 

A x B. This proves (17). Note that if A and B do not commute, they must 
appear in the same order on both sides of the identity. 


3. A convenient basis of the 2x2 matrix space 

Consider an arbitrary 2x2 matrix: 


m = r 11 m ' 2 


It can always be written as a linear combination of the four matrices : 

/, <t x , a r o z ( 20 ) 

since, using (2), we can immediately verify that: 


m i i + m 22 , , w,,-w 22 _ , m l2 + m 2l _ m i2 ~ m n 

--- / ~r --- rr ? + -r- fT, + l -r- 


S (21) 


Therefore, any 2x2 matrix can be put in the form: 

M = a 0 I + a . <r (22) 

where the coefficients a 0 , a x , a y and a z are complex numbers. 

Comparing (21) and (22), we see that M is Hermitian if and only if 
the coefficients a Q and a are real. These coefficients can be expressed formally in 
terms of the matrix M in the following manner: 


u 0 =-Tr(M) 


a = - Tr (M<t) 


(23-a) 


(23-b) 


These formulas can easily be proven from (8), (9) and (10). 
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Complement B, v 

DIAGONALIZATION OF A 2 x 2 HERMITIAN MATRIX 


1. Introduction 

2. Changing the eigenvalue origin 

3. Calculation of the eigenvalues and eigenvectors 

a. Angles 0 and <p 

b. Eigenvalues of K 

c. Eigenvalues of H 

d. Normalized eigenvectors of H 


1. Introduction 


In quantum mechanics, one must often diagonalize 2x2 matrices. When 
we need only the eigenvalues, it is very easy to solve the characteristic equation 
since it is of second degree. In principle, the calculation of the normalized 
eigenvectors is also extremely simple; however, if it is performed clumsily, it can 
lead to expressions which are unnecessarily complicated and difficult to handle. 

The goal of this complement is to present a simple method of calculation 
which is applicable in all cases. After having changed the origin of the eigenvalues, 
we introduce the angles 9 and <p, defined in terms of the matrix elements, which 
enable us to write the normalized eigenvectors in a simple easy-to-use form. 
The angles 9 and <p also have an interesting physical interpretation in the 
study of two-level systems, as we shall see in complement C 1V . 


2. Changing the eigenvalue origin 


Consider the Hermitian matrix: 

H - 

V«21 H 22 , 

// n and H 22 are real. Moreover : 

7/ 12 = H*, 


0 ) 


( 2 ) 


The matrix ( H ) therefore represents, in an orthonormal basis, { | >, | cp 2 ) }, 

a certain Hermitian operator H*. 


* We use the letter H because the Hermitian operator which we are trying to diagonalize is often 
a Hamiltonian. Nevertheless, the calculations presented in this complement can obviously be applied to 
any 2x2 Hermitian matrix. 
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Using the half-sum and half-difference of the diagonal elements N lt and 
H 22 , we can write (H) in the following way: 


(H) = 


2( H n + H 22 ) 


2^..+"22) 


(«,.-» 22 ) 


(Hu-H 22 ) 


It follows that the operator H itself can be decomposed into: 

H = j (H„ 4- H 22 ) H + 3 (H,, - H 22 ) K (4) 

where H is the identity operator and K is the Hermitian operator represented 
in the { | <p, >, | <p 2 > } basis by the matrix: 

( 1 2 JLi \ 

(K) = „„ (5) 

2^*21 _ 1 

\H\i~ H 22 J 

It is clear from (4) hat H and K have the same eigenvectors. Let | ij/ ± > 
be these eigenvectors, and E ± and k ± , the corresponding eigenvalues for H 
and K : 

ff|* ± >=£ ± .|* ± > ( 6 ) 

K\+ ± > = k ± \* ± > (7) 

From (4), we immediately conclude that: 

E± = \ (Hi . + ^ 22 ) + ^ 11 - H 22 ) k ± ( 8 ) 

Finally, the first matrix appearing on the right-hand side of (3) plays a 
minor role: we could make it disappear by choosing for the eigenvalue origin 

(tfn + h 22 )!2*. 


Calculation of the eigenvalues and eigenvectors 
ANGLES 0 AND q> 

Let 6 and (p be the angles defined in terms of the matrix elements H tj by : 


tan 6 = —iLL— 
**11 **22 

=|H !1 |e to 


with 0 ^ 0 < it 


with 0 ^ (p < lit 


* Furthermore, this new origin is the same, whatever the basis { | <p, >, | q> 2 > } initially chosen, 
since H lx + H 21 = lt(H) is invariant under a change of orthonormal bases. 
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(p is the argument of the complex number H 2l . According to (2), we have 
|//i 2 | = |// 2 i|and: 

H l2 = \H l2 \e-‘* ( 11 ) 

If we use (9), (10) and (11), the matrix (AT) becomes: 


(K) 


/ 1 tan 9 c~ i<p \ 

\tan 9 e‘ > — 1 / 


( 12 ) 


b. EIGENVALUES OF K 


The characteristic equation of the matrix (12) 

Det [(AT) - k/] = k 2 — 1 — tan 2 0 = 0 (13) 



(14-a) 

(14-b) 


We see that they are indeed real (property of a Hermitian matrix, cf. §D-2-a 
of chapter II). If we want to express 1/cos 9 in terms of H ip all we need to do 
is use (9) and notice that cos 9 and tan 6 have the same sign since 0 ^ 0 < it: 



c. EIGENVALUES OF H 

Using ( 8 ), (14) and (15), we immediately obtain : 

E+ = \ («i i + H 22 ) + \ n/(H, , - // 22 ) 2 + 4 |// 12 | 2 (16-a) 

E_ =\(H li + H 22 ) - X -J{H xx -H 22 ) 2 + 4\H,J 2 (16-b) 

comments: 

(/) As we have already said, the eigenvalues (16) can easily be obtained from 
the characteristic equation of the matrix (H). If we need only the eigenvalues 
of (//), it is therefore not necessary to introduce the angles 0 and (p as we 
have done here. On the other hand, we shall see in the following section 
that this method is very useful when we need to use the normalized 
eigenvectors of H. 

(ii) It can be verified immediately from formulas (16) that: 

E + + E_ = Hu + H 22 = Tr (H) 

E + E_ = H u H 22 - \H l2 \ 2 = Det (H) 
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(17) 

(18) 




2x2 HERMITIAN MATRIX 


(iii) To have E + = we must have (H ll — H 22 ) 2 + 4 \H l2 \ 2 = 0; that is, 
H ll = H 22 and H l2 = H 21 = 0. A 2 x 2 Hermitian matrix with a degenerate 
spectrum is therefore necessarily proportional to the unit matrix. 



comment : 

While the trigonometric functions of the angle 0 can be expressed 
rather simply in terms of the matrix elements H (j [see, for example, 
formulas (9) and (15)], the same is not true of those of the angle 0/2. 
Consequently, formulas (22) and (23) for the normalized eigenvectors 
11 /> + > and 1 1 [>_ } become complicated when cos 0/2 and sin 0/2 are replaced 
by their expressions in terms of H^-, they are no longer very convenient. 
It is better to use expressions (22) and (23) directly, keeping the functions 
cos 0/2 and sin 0/2 during the entire calculation involving the normalized 
eigenvectors of H. Furthermore, the final result of the calculation often 
involves only functions of the angle 0 (see, for example, the calculation of 
§C-3-b of chapter IV) and, consequently, can be expressed simply in terms 
of the H i} . Expressions (22) and (23) thus enable us to carry out the 
intermediate calculations elegantly, avoiding unnecessarily complicated expres¬ 
sions. This is the advantage of the method presented in this complement. 
Another advantage concerns the physical interpretation and will be discussed 
in the next complement. 


Cohen-Tannoudji. - Quantum mechanics. - 1. 
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Complement C, v 

FICTITIOUS SPIN 1/2 ASSOCIATED WITH A TWO-LEVEL SYSTEM 


1. Introduction 

2. Interpretation of the Hamiltonian in terms of fictitious spin 

3. Geometrical interpretation of the various effects discussed in § C of chapter IV 

a. Fictitious magnetic fields associated with H 0 , W and H 

b. Effect of coupling on the eigenvalues and eigenvectors of the Hamiltonian 

c. Geometrical interpretation of ^ 12 (0 


1. Introduction 


Consider a two-level system whose Hamiltonian H is represented, in an 
orthonormal basis { | cp t >, | <p 2 > }, by the Hermitian matrix ( H ) [formula (1) 
of complement B IV ]*. If we choose (H ii + H 21 )/2 as the new energy origin, the 
matrix (H) becomes: 

H, ' 


(H) = 


^(H.i -»«) 


'12 


h 21 


1 


- 2 ^ 11 -^ 22 ) 


(1) 


Although the two-level system under consideration is not necessarily a 
spin 1/2, we can always associate with it a spin 1/2 whose Hamiltonian H 
is represented by the same matrix (H) in the { | + >, | — > } basis of eigenstates 
of the S z component of this spin. We shall see that (H) can then be interpreted 
as describing the interaction of this “fictitious spin” with a static magnetic 
field B, whose direction and modulus are very simply related to the parameters 
introduced in the preceding complement in the discussion of the diagonalization 
of (H). Thus it is possible to give a simple physical meaning to these parameters. 

Moreover, if the Hamiltonian H is the sum H = H 0 + W of two operators, 
we shall see that we can associate with H, H 0 and W three magnetic fields, B, 
B 0 and b, such that B = B 0 + b. Introducing the coupling W is equivalent, 
in terms of fictitious spin, to adding the field b to B 0 . We shall show that 
this point of view enables us to interpret very simply the different effects 
studied in § C of chapter IV. 


2. Interpretation of the Hamiltonian 
in terms of fictitious spin 

We saw in chapter IV that the Hamiltonian H of the coupling between 
a spin 1/2 and a magnetic field B, of components B x , B y , B z , can be written: 

H — — y B.S = — y(B x S x + B y S y + B Z S Z ) (2) 


* We are using the same notation as in complement B IV and chapter IV. 
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To calculate the matrix associated with this operator, we substitute into this relation 
the matrices associated with S x , S y , S 2 [chap. IV, relations (A-15), (A-16), (A-17)]. 
This immediately yields: 


(H) 


y±( B z B x -iB\ 
2 \B X + iB y -B z ) 


(3) 


Therefore, to make matrix (1) identical to (H), we must simply choose a 
“fictitious field” B defined by: 



Note that the modulus B ± of the projection B ± of B onto the xOy plane is then 
equal to: 


d _2 H l2 
B±. — -r —— 
ft y 


(5) 


According to formulas (9) and (10) of complement B IV , the angles 0 and (p 
associated with the matrix (H) = (H) written in (3) are given by: 


tan 0 = 0 ^ 0 < n 

< - yB z 

„ - y (B x + iB y ) = |yB x | e* > 0 ^ <p < 2n 


( 6 ) 


The gyromagnetic ratio y is a simple calculation tool and can have an arbitrary 
value. If we agree to choose y negative, relations ( 6 ) show that the angles 0 and q> 
associated with the matrix ( H ) are simply the polar angles of the direction of 
the field B (if we had chosen y positive, they would be those of the opposite 
direction). 

Finally, we see that we can forget the two-level system with which we 
started and consider the matrix (H) as representing, in the basis of the eigenstates 

| + > and | — ) of S,, the Hamiltonian H of a spin 1 /2 placed in a field B whose 
components are given by (4). H can also be written: 

H = coS u (7-a) 

where S u is the operator S . u which describes the spin component along the 
direction u, whose polar angles are 0 and <p, and co is the Larmor angular 
velocity: 

<0 = |?| |B| (7-b) 
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The following table summarizes the various correspondences between the two-level 
system and the associated fictitious spin 1/2. 


Two-level system 

I <Pi > 

I <Pl > 

I '!'+ > 

I*- > 

E + - E_ 

Angles 6 and (p introduced 
in B| V 
H xy -H 2l 

l»i.l 


Fictitious spin 1/2 

I + > 

|-> 

I + >« 

|->. 

Tia> 

Polar angles 
of the fictitious field B 
-yhB z 
- yhBJ2 


3. Geometrical interpretation of the various effects 
discussed in § C of chapter IV 


a. FICTITIOUS MAGNETIC FIELDS ASSOCIATED WITH H 0 . W AND H 


Assume, as in §C of chapter IV, that H appears as the sum of two 
terms: 


H = H 0 + W 


( 8 ) 


In the {!<?!>, | </> 2 > } basis, the unperturbed Hamiltonian H 0 is represented 
by a diagonal matrix which, with a suitable choice of the energy origin, is 
written: 



\ 


0 



( 9 ) 


As far as the coupling W is concerned, we assume, as in §C of chapter IV, 
that it is purely non-diagonal: 



The discussion of the preceding section then enables us to associate 
with (H 0 ) and (W) two fields B 0 and b such that \cf. formulas (4) and (5)]: 


D _ ^2 ~ £l 
0QZ ~ 


yft 


Box — ® 




( 11 ) 


( 12 ) 
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B 0 is therefore parallel to Oz and proportional to (E l — E 2 )j 2; b is perpendicular 
to Oz and proportional to \W l2 \. Since (H) = (H 0 ) + (W), the field B associated 
with the total Hamiltonian is the vector sum of B 0 and b : 

B = B o + *> (13) 

The three fields B 0 , b and B are shown in figure 1; the angle 6 introduced 
in §C-2-a of chapter IV is the angle between B 0 and B, since B 0 is parallel 
to Oz. 



The strong coupling condition introduced in §C-2 of chapter IV 
{\W 12 \ P | E { — E 2 \) is equivalent to |b| P |B 0 (fig. 2-a). The weak coupling 
condition (\W l2 \ ^ \ E i ~ ^ 2 |)i s equivalent to |b < |B 0 | (fig. 2-b). 



FIGURE 2 

Relative disposition of the fictitious fields B 0 , b and B in the case of strong coupling (fig. a) and 
weak coupling (fig. b). 
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b. EFFECT OF COUPLING ON THE EIGENVALUES 

AND EIGENVECTORS OF THE HAMILTONIAN 

E t — E 2 and E + — E_ correspond respectively to the Larmor angular 
velocities to 0 = |y| |B 0 | and to = |y| |B| in the fields B 0 and B. We see in figure 1 
that B 0 , b and B form a right triangle whose hypotenuse is B; we therefore 
always have |B| > |B 0 |, which again shows that E+ — E_ is always greater than 
\E t -E 2 \. 

For a weak coupling (fig. 2-b), the difference between |B| and |B 0 | is very 
small in relative value, being of second order in |b|/|B 0 |. From this we deduce 
immediately that E+ — E_ and E 1 — E 2 differ in relative value by terms of 
second order in \W i2 \/(E 1 — E 2 ). On the other hand, for a strong coupling 
(fig. 2-a), |B| is much larger than |B 0 | and practically equal to |b|; E + — E_ is 
then much larger than \E l — E 2 \ and practically proportional to \W l2 \. We thus 
find again all the results of § C-2 of chapter IV. 

As far as the effect of the coupling on the eigenvectors is concerned, 
it can also be understood very simply from figures 1 and 2. The eigenvectors of H 
and H 0 are associated respectively with the eigenvectors of the components of S 
on the Ou and Oz axes. These two axes are practically parallel in the case of 
weak coupling (fig. 2-b) and perpendicular in the case of strong coupling 
(fig. 2-a). The eigenvectors of S u and S z , and, consequently, those of H and H 0 , 
are very close in the first case and very different in the second one. 

C. GEOMETRICAL INTERPRETATION OF 0> t2 (t) 

In terms of fictitious spin, the problem considered in §C-3 of chapter IV 
can be put in the following way: at time t — 0, the fictitious spin associated 
with the two-level system is in the eigenstate | + > of S z ; b is added to B 0 ; what 



FIGURE 3 

Geometrical interpretation erf Rabi’s formula in terms of fic¬ 
titious spin. Under the effect of the coupling (represented 
by h), the spin, initially oriented along Oz , precesses about B; 
consequently, the probability of finding — fi/2 in a measure¬ 
ment of its S 2 component on Oz is an oscillating function of 
time. 
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is the probability of finding the spin in the state | — > at time f ? With 

the correspondences summarized in the table, &[ 2 {t) must be identical to _(/). 

The calculation of & + _(/) is then very simple since the time evolution 
of the spin reduces to a Larmor precession about B (fig. 3). During this 
precession, the angle 0 between the spin and the direction Ou of B remains 
constant. At time t, the spin points in the direction On, making an angle a with 
Oz; the angle formed by the ( Oz, Ou) and (Ou, On) planes is equal to col. A 
classical formula of spherical trigonometry enables us to write: 

cos a = cos 2 0 + sin 2 0 cos wt (14) 

Now, when the spin points in a direction which makes an angle of a with Oz, 
the probability of finding it in the state | — > of S z is equal (cf. §B-2-b of 
chapter IV) to sin 2 a/2 = (1 — cos a)/2. From this we deduce, using (14), that: 

^ + _(l) — sin 2 -^ = -^sin 2 0(l — cos ml) (15) 

This result is identical, when we replace co by (E + — E_ )/h, to formula (C-31) of 
chapter IV (Rabi’s formula). We have thus given this formula a purely geometrical 
interpretation. 


References and suggestions for further reading: 

Abragam (14.1), chap. II, §F; Sargent et al. (15.5), §7-5; Allen (15.7), chap. 2; 
see also the article by Feynman et al. (1.33). 
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Complement D, v 

SYSTEM OF TWO SPIN 1/2 PARTICLES 


1. Quantum mechanical description 

a. State space 

b. Complete sets of commuting observables 

c. The most general state 

2. Prediction of measurement results 

a. Measurements bearing simultaneously on the two spins 

b. Measurements bearing on one spin alone 

In this complement, we intend to use the formalism introduced in §A-2 of 
chapter IV to describe a system of two spin 1/2 particles. This case is hardly 
more complicated than that of a single spin 1/2 particle. Its interest, as far as the 
postulates are concerned, lies in the fact that none of the various spin observables 
alone constitutes a C.S.C.O. (while this is the case for one spin alone). Thus, 
we shall be able to consider measurements bearing either on one observable 
with a degenerate spectrum or simultaneously on two observables. In addition, 
this study provides a very simple illustration of the concept of a tensor product, 
introduced in §F of chapter II. We shall be concerned, as in chapter IV, only 
with the internal degrees of freedom (spin states), and we shall moreover assume 
that the two particles which constitute the system are not identical (systems of 
identical particles will be studied in a general way in chapter XIV). 


1. Quantum mechanical description 

We saw in chapter IV how to describe quantum mechanically the spin state 
of a spin 1/2 particle. Thus, all we need to do is apply the results of § F of 
chapter II in order to know how to describe systems of two spin 1/2 particles. 


a. STATE SPACE 

We shall use the indices 1 and 2 to distinguish between the two particles. 
When particle (1) is alone, its spin state is defined by a ket which belongs to a 
two-dimensional state space <? s (l). Similarly, the spin states of particle (2) alone 
form a two-dimensional space <f s (2). We shall designate by and S 2 the spin 
observables of particles (1) and (2) respectively. In <f s (l) [or ^ s (2)], we choose 
as a basis the eigenkets of S lz (or S 2z ), which we shall denote by | 1: + >> and 
| 1 : — > (or | 2 : + > and | 2 : — >). The most general ket of <? s (l) can be written : 

|*(1)> = <*, |l: + > + /Ml:-> (1) 
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and that of <? s (2) : 

| * (2) > — a 2 | 2 : + > + | 2 : — > (2) 

(a l9 a 2 , are arbitrary complex numbers). 

When we join the two particles to make a single system, the state space S s of 
such a system is the tensor product of the two preceding spaces: 

S s = S s ( 1)®<? S (2) (3) 

In the first place, this means that a basis of S s can be obtained by multiplying 
tensorially the two bases defined above for <? s (l) and <f s (2). We shall use 
the following notation : 

| + + >- |l:+>|2: + > 

| + - > - |1 : + > | 2 : -> 

| - + > = |1 : - >| 2 : + > 

|-> = |l:->|2:-> (4) 

In the state | + — >, for example, the component along Oz of the spin of 
particle (1) is + h/2, with absolute certainty; that of the spin of particle (2) is 
- hj 2, with absolute certainty. We shall agree here to denote by < H— | the 
conjugate bra of the ket | -I— ); the order of the symbols is therefore the 

same in the ket and in the bra: the first symbol is always associated with particle (1) 

and the second, with particle (2). 

The spaced is therefore four-dimensional. Since the {11 : ± ) } and {12 : ± )} 
bases are orthonormal in <? s (l) and <f s (2) respectively, the basis (4) is orthonormal 
in : 

< £^2 | £ i £2 > = d eiti (5) 

(£ 1? £ 2 , e[, s 2 are to be replaced by + or - depending on the case; 5 a , is equal to 1 
if e and e' are identical and 0 if they are different). The system of vectors (4) 
also satisfies a closure relation in : 

E [M 2 XM 2 I = I + +X+ + I + | + -><+-| + 

£|£2 

!- + ><- +1 + I — >< — I = 1) 

(«) 

b. COMPLETE SETS OF COMMUTING OBSERVABLES 

We extend into S s the observables and S 2 which were originally defined 
in <? s (l) and <f s (2) (as in chapter II, we shall continue to denote these extensions 
by Sj and S 2 ). Their action on the kets of the basis (4) is simple: the components 
of S l5 for example, act only on the part of the ket related to particle (1). 
In particular, the vectors of the basis (4) are simultaneous eigenvectors of S lz and 

$2z ' 

Slz | £ l £ 2 ) = ~2 6 1 I £ 1 £ 2 ) 

n (7) 

s 2 z l £ i £ 2 y = ~2 £ 2 1 e i e 2 y 
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For the other components of Sj and S 2 , we apply the formulas given in §A-2 of 
chapter IV. For example, we know from relation (A-16) of chapter IV how 
S lx acts on the kets | 1: ± > : 

# ( 8 ) 

S u |l:->=||l: + > 

From this we deduce the action of S lx on the kets (4): 

Si* I + + ) = 2 I — h > 
s ix I 4 — > = ? I ^ 

( 9 ) 

Si*| —^ ^ = 2 ^ + + ^ 

Sul- -> -|l+ -> 


It is then easy to verify that, although the three components of S t 
(or of S 2 ) do not commute with each other, any component of commutes 
with any component of S 2 . 

In S s ( 1), the observable S l2 alone constituted a C.S.C.O., and the same 
was true of S 2z in <? s (2). In S s , the eigenvalues of S l2 and S 2z remain ± h/2, but 
each of them is two-fold degenerate. To the eigenvalue + h/2 of S l2 , for example, 
correspond two orthogonal vectors, | + + > and | + — > [formulas (7)] and all 
their linear combinations. Therefore, in neither S l2 nor S 2l (taken separately) 
constitutes a C.S.C.O. On the other hand, the set { S l2 , S 2z } is a C.S.C.O. 
in as can be seen from formulas (7). 

This is obviously not the only C.S.C.O. that can be constructed. For 
example, another one is { S lz , S 2x }. These two observables commute, as we noted 
above, and each of them constitutes a C.S.C.O. in the space in which it was initially 
defined. The eigenvectors which are common to S lz and S 2x are obtained by 
taking the tensor product of their respective eigenvectors in <f s (l) and <f s (2). 
Using relation (A-20) of chapter IV, we find: 

| 1 : + > | 2 : + > JC = — y= [| + + > + | H— )] 


1 : + > | 2 : - >, = -j= [| + + > - | H— >] 


1 :->| 2 :+> X = — [|-+> + |-->] 


I : > | 2 : > JC = — [| — + > — |- >] 
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c. THE MOST GENERAL STATE 

The vectors (4) were obtained by multiplying tensorially a ket of S s {\) 
and a ket of <? s (2). More generally, using an arbitrary ket of <f s (l) [such as (1)] 
and an arbitrary ket of <? 5 (2) [such as (2)], we can construct a ket of S s : 

| <?(!)> |*(2)> 

= a,a 2 | + + > + 0.^2 | + — > + a 2 /l, | - + > 4- /?,/l 2 |-> (11) 

The components of such a ket in the basis (4) are the products of the components 
of |<p(l)> and |x(2)> in the bases of / s (l) and <? s (2) which were used to 
construct (4). 

But all the kets of i s are not tensor products. The most general ket of S s is 
an arbitrary linear combination of the basis vectors: 

|</,> = a |++> + /J.| + ->+ y |_ + >+ «5|--> (12) 

If we want to normalize |«// >, we must choose: 

M 2 + W + bf + W - i (is) 

Given |«// >, it is not in general possible to find two kets | <jo(l)> and | *(2)> 
of which it is the tensor product. For (12) to be of the form (11), we must have, 
in particular: 


and this condition is not necessarily fulfilled. 


2. Prediction of the measurement results 

We are now going to envisage a certain number of measurements which 
can be performed on a system of two spin 1/2 particles and we shall calculate 
the predictions furnished by the postulates for each of them. We shall assume 
each time that the state of the system immediately before the measurement is 
described by the normalized ket (12). 


a. MEASUREMENTS BEARING SIMULTANEOUSLY ON THE TWO SPINS 

Since any component of S t commutes with any component of S 2 , we can 
envisage measuring them simultaneously (chap. Ill, §C-6-a). To calculate the 
predictions related to such measurements, all we need to do is use the eigenvectors 
common to the two observables. 

a. First example 

First of all, let us assume that we are simultaneously measuring S u and 
What are the probabilities of the various results that can be obtained? 
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Since the set { S lz , S 2z } is a C.S.C.O., there exists only one state associated 
with each measurement result. If the system is in the state (12) before the measure¬ 
ment, we can therefore find : 



1 z 


55 


55 


55 



for S 2z , with the probability |< + + | >| 2 = |a| 2 

” ” |< + - | ^ >| 2 = ffi 

” ” |< - + | >| 2 = M 2 

|<--|«M 2 = H 2 (is) 


p. Second example 

We now measure S ly and S lz . What is the probability of obtaining + ft/2 
for each of the two observables ? 

Here again, { S ly , S 2z } constitutes a C.S.C.O. The eigenvector common 
to S ly and S 2z which corresponds to the eigenvalues + ft/2 and + ft/2 is the 
tensor product of the vector 11: + > y and the vector | 2 : + > : 

|l: + > y |2: + >=-4=[|++> + i|“+>] 0 6 ) 


Applying the fourth postulate of chapter III, we find that the probability we are 
looking for is: 



(17) 


The result therefore appears in the form of a “square of a sum”*. 

After the measurement, if we have actually found + ft/2 for S ly and + ft/2 
for S 2z , the system is in the state (16). 


b. MEASUREMENTS BEARING ON ONE SPIN ALONE 

It is obviously possible to measure only one component of one of the two 
spins. In this case, since none of these components constitutes by itself a C.S.C.O., 
there exist several eigenvectors corresponding to the same measurement result, 
and the corresponding probability will be a “sum of squares”. 


* It must be remembered that the sign of i changes when we go from (16) to the conjugate bra. If 
this were to be forgotten, the result obtained would be incorrect (|a + iy | 2 ^ |a — iy\ 2 since a fy is not in 
general real). 
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at. First example 

We measure only S l2 . What results can be found, and with what probabilities? 

The possible results are the eigenvalues + fi/2 of S l2 . Each of them is doubly 
degenerate. In the associated eigensubspace, we choose an orthonormal basis: 

we can, for example, take { | + + >, | + - > } for + fi/2 and { | - + >, |-> } 

for — Ji/2. We then obtain: 

1 ) = i< + +1 * >r + l< + -1 <A >| 2 
= M 2 + W 

1) - K - + k >l 2 +1< - -1 ^ >l ! ( ' 8) 

= W + H 2 


COMMENT : 


Since we are not performing any measurement on the spin (2), the 
choice of the basis in <? s (2) is arbitrary. We can, for example, choose as a basis 
of the eigensubspace of S l2 associated with the eigenvalue + fi/2 the vectors: 


1 : + > I 2 : ± > x =-L[| + +>±| + ->] 


which again gives us : 


(19) 


- M 2 + W po) 

The general proof of the fact that the probability obtained is inde¬ 
pendent (in the case of a degenerate eigenvalue) of the choice of the basis 
in the corresponding eigensubspace was given in §B-3-b-a of chapter III. 


P. Second example 

Now it is S 2x that we want to measure. What is the probability of 
obtaining - fi/2? 

The eigensubspace associated with the eigenvalue — fi/2 of is two- 
dimensional. We can choose for a basis in it: 


| 1: + > | 2: - >, = — y= [|+ + > - | H-)] 

|1: — > | 2: ->,=-L[|-+>-|-->] 


( 21 ) 


We then find: 


& = 


-^[< ++1-< +-i]i*> + -^t<- + i-<--|]|*> 




( 22 ) 
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In this result, each of the terms of the “sum of squares” is itself the “square of 
a sum 

If the measurement actually yields — h/2, the state | if/' > of the system 
immediately after this measurement is the (normalized) projection of | ^ > onto 
the corresponding eigensubspace. We have just calculated the components of I ^ > 

1 

on the basis vectors (21) of this subspace : they are equal, respectively, to —— (a — fi) 

, ^ 

and —(y — d). Consequently : 

V2 

i > - , *— r^(« - «d++> -1+-» 

»-*r 

+ j(y-«)(|-+>-|--»l(23) 


COMMENT : 

We have considered, in this complement, only the components of S t 
and S 2 on the coordinate axes. It is obviously possible to measure their 
components S x . u and S 2 . v on arbitrary unit vectors u and v. The reasoning 
is the same as above. 
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SPIN 1/2 DENSITY MATRIX 


1. Introduction 

2. Density matrix of a perfectly polarized spin (pure case) 

3. Example of a statistical mixture: unpolarized spin 

4. Spin 1/2 in thermodynamic equilibrium in a static field 

5. Expansion of the density matrix in terms of the Pauli matrices 

1. Introduction 

The aim of this complement is to illustrate the general considerations 
developed in complement E,„, using a very simple physical system, that of 
a spin 1/2. We are going to study the density matrices which describe a spin 1/2 
in a certain number of cases : perfectly polarized spin (pure case), unpolarized 
or partially polarized spin (statistical mixture). We shall thus be able to verify and 
interpret the general properties stated in complement E, n . In addition, we shall 
see that the expansion of the density matrix in terms of the Pauli matrices can be 
expressed very simply as a function of the mean values of the various spin 
components. 


2. Density matrix of a perfectly polarized spin (pure case) 


Consider a spin 1/2, coming out of an “atomic polarizer” of the type described 
in § B of chapter IV, which is in the eigenstate | + > u (eigenvalue + /i/2) of the S . u 
component of the spin (recall that the polar angles of the unit vector u are 6 and </>). 
The spin state is then perfectly well-known and is written [cf. formula (A-22-a) 
of chapter IV] : 

|*> - cos^e -1 '*' 2 1 + > + sin ^ e 1>/2 1 — > (1) 


We saw in complement E, n that, by definition, such a situation corresponds to 
a pure case. We shall say that the beam which leaves the “polarizer” is perfectly 
polarized. Recall also that, for each spin, the mean value <S> is equal 

to - u [chap. IV, relations (B-14)]. 

It is simple to write, in the ( | + >, | — ) } basis, the density matrix p(9, <p) 
corresponding to the state (1). We write the matrix of the projector onto this state : 


/ 


p(e, q>) = 


cos 


20 


e 6 j 
^sin ^ cos ^ e 


. e 9 
sin — cos ^ 




( 2 ) 
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This matrix is generally non-diagonal. The “populations” p + + and p__ have 
a very simple physical significance. Their difference is equal to cos 6 = 2< S z }fh 
[cf. equations (B-14) of chapter IV], and their sum is, of course, equal to 1. The 
populations are therefore related to the longitudinal polarization < S, >. Similarly, the 

modulus of the “coherences ”p + _ andp_ + is|p + _| = |p_ + | = ^sin 6 = j | < S x ) | 

(where < S x > is the projection of < S > onto the xOy plane). The argument of p_ + 
is (p, that is, the angle between < S x > and Ox : the coherences are therefore related 
to the transverse polarization < S x >. 

It can also be verified that: 

[p(0,<p)] 2 = p(0,<*>) (3) 

a relation characteristic of a pure state. 


3. Example of a statistical mixture: unpolarized spin 


Now let us consider the spin of a silver atom leaving a furnace, such as the 
one in figure 1 of chapter IV, and which has not passed through an “atomic 
polarizer” (the spin has not been prepared in a particular state). The only 
information we then possess about this spin is the following : it can point in any 
direction of space, and all directions are equally probable. With the notation of 
complement E,„, such a situation corresponds to a statistical mixture of the 
states | + >„ with equal weights. Formula (28) of complement E„, defines the 
density matrix p which corresponds to this case. Nevertheless, the discrete 

sum Yj must here be replaced by an integral over all the possible directions: 

k 


P 


4n 


I 


dC2 p(d, (p) = — 



sin d dd p(6, (p) 


(4) 


(the factor 1/4 tc insures the normalization of the probabilities associated with the 
various directions). The integrals which give the matrix elements of p are simple to 
calculate and lead to the following result: 



It is easy to deduce from (5) that p 1 = p/2, which shows that, in the case of a 
statistical mixture of states, p 2 is different from p. 

In addition, if we calculate from (5) the mean values of S x , S v , S 2 , we obtain : 

<S i > = Tr[pS 1 ]=^TrS,. = 0 i = x,y,z (6) 


We again find the fact that the spin is unpolarized : since all the directions are 
equivalent, the mean value of the spin is zero. 
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comments: 

(i) It is clear from this example how the non-diagonal elements (coherences) 
of p can disappear from the summation over the various states of the 
statistical mixture. As we saw in § 2, the coherences p + _ and p_ + are related 
to the transverse polarization <S X > of the spin. Upon summing the 
vectors < S x > corresponding to all (equiprobable) directions of the xOy plane, 
we obviously find a null result 

(h) The case of unpolarized spin is also very instructive, since it helps us to 
understand the impossibility of describing a statistical mixture by an “average 
state vector”. Assume that we are trying to choose a and P so that the vector: 

U>=a| + > + 0|-> ( 7 ) 

with : 

M 2 + W = 1 (8) 

represents an unpolarized spin, for which < S x >, (S y ) and < S 2 > are zero. 
A simple calculation gives: 

< S, > = \ (a*p + op*) 

<S,> = j.^*P - aft*) (9) 

< S 2 > =~ (a*a - p*P) 

If we want to make < S x > zero, we must choose a and p so as to make a*P a 
pure imaginary; similarly, a *p must be real for <S y > to be zero. We must 
therefore have a*p = 0; that is: 

either a = 0, which implies \P\ = 1 and < S 2 > = — Till 

or P = 0, which implies |a| = 1 and < S z > = H/2 

Therefore, < S 2 >, (S x ) and <S y > cannot all be zero at the same time; 
consequently, the state (7) cannot represent an unpolarized spin. 

Furthermore, the discussion of §B-l-c of chapter IV shows that for 
any a and p which satisfy (8), one can always associate with them two 
angles 6 and (p which fix a direction u such that | \j/ > is an eigenvector of S . u 
with the eigenvalue + Ti/2. Thus we see directly that a state such as (7) always 
describes a spin which is perfectly polarized in a certain direction of space. 

{Hi ) The density matrix (5) represents a statistical mixture of the various 
states | + >„, all the directions u being equiprobable (this is how we obtained 
it). We could, however, imagine other statistical mixtures which would 
lead to the same density matrix : for example, a statistical mixture of equal 
proportions of the states | + > and | — ), or a statistical mixture of equal 
proportions of three states | 4- )„ such that the tips of the three corresponding 
vectors u are the vertices of an equilateral triangle centered at O. Thus we see 
that the same density matrix can be obtained in several different ways. In 
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fact, since all the physical predictions depend only on the density matrix, 
it is impossible to distinguish physically between the various types of statis¬ 
tical mixtures which lead to the same density matrix. They must be considered 
to be different expressions for the same incomplete information which we 
possess about the system. 


4. Spin 1/2 in thermodynamic equilibrium 
in a static field 


Consider a spin 1/2 placed in a static field B 0 parallel to Oz. We saw in § B-3-a 
of chapter IV that the stationary states of this spin are the states | + ) and | — >, 
of energies + fico 0 /2 and — Tuojl (with u> 0 = — yB 0 , where y is the gyromagnetic 
ratio of the spin). If we know only that the system is in thermodynamic equilibrium 
at the temperature T, we can assert that it has a probability Z” 1 e ~ s “ o/2kT of being 
in the state | + ) and Z -1 e +So>o/2kr of being in the state | — ), where 
Z = e _s " o/2kT + e +f,m °i 2kT is a normalization factor (Z is called the “partition 
function”). We have here another example of a statistical mixture, described by 
the density matrix : 


p = Z 



-fuooUkT 

0 


0 

+ fuooJ2kT 


( 10 ) 


Once more, it is easy to verify that p 2 # p. The non-diagonal elements are zero 
since all directions perpendicular to B 0 (that is, to Oz) and fixed by the angle <p are 
equivalent. 

From (10), it is easy to calculate: 

<S X > =Tr(pS x ) = 0 

< Sj, > = Tr {pS y ) = 0 (11) 

<S,>=Tr(p S .)=-5 

We see that the spin acquires a polarization parallel to the field in which it is placed. 
The larger iu 0 (that is, fl 0 ) and the lower the temperature T, the greater the 
polarization. Since |tanh x\ < 1, this polarization is less than the value h/2 which 
corresponds to a spin which is perfectly polarized along Oz. (10) can therefore be 
said to describe a spin which is “ partially polarized ” along Oz. 


comment : 

The magnetization < M z ) is equal to y< S z >. It is possible to calculate 
from (11) the paramagnetic susceptibility x of the spin, defined by: 

< M z > = y < S 2 > = X fi 0 (12) 

We find (Brillouin’s formula): 
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5. Expansion of the density matrix 
in terms of the Pauli matrices 

We saw in complement A,v that the unit matrix 1 and the Pauli matrices <r x , 
a y and o. form a convenient basis for expanding a 2 x 2 matrix. We therefore set, 
for the density matrix p of a spin 1/2 : 

p = a 0 J + a . <r (14) 

where the coefficients a ( are given by [cf. complement A 1V , relations (23)]: 

flo = 3>Tr p 

a x =\Tv{pa x )=\Tv{pS x ) 

a y =^Tr (p<r y ) = ^ Tr (P^y) 

« z =\lr{pc z ) =j;Tr (pS z ) 

Thus we have: 

1 

a o ~ 2 

• ’i <S} 

and p can be written: 

p=jl + j< S).<r 

Therefore, the density matrix p of a spin 1/2 can be expressed very simply in 
terms of the mean value < S > of the spin. 


(16) 

( 1 /) 


comment: 

Let us square expression (17). We obtain, using identity (17) of 
complement A IV : 

P 2 =~I + 7 i<S> 2 / +^<S>.(r ( 18 ) 

4 7r n 

The condition p 2 = p, characteristic of the pure case, is therefore equivalent, 
for a spin 1/2, to the condition: 

< s > ! = £ (19) 
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This condition is obviously not satisfied for an unpolarized spin «S ) is then zero) 
or for a spin in thermodynamic equilibrium (we saw in §4 that in this 
case |< S > | < Ti/2). On the other hand, it can be verified, using formulas (B-14) 
of chapter IV, that, for a spin in the state | if/ > given in (1), <S > 2 is indeed equal 
to Ti 2 J4. 


References and suggestions for further reading: 

Abragam (14.1), chap. II, §C. 
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Complement F IV 

SPIN 1/2 PARTICLE IN A STATIC MAGNETIC FIELD 
AND A ROTATING FIELD: MAGNETIC RESONANCE 


1. Classical treatment; rotating reference frame 

a. Motion in a static field; Larmor precession 

b. Influence of a rotating field; resonance 

2. Quantum mechanical treatment 

a. The Schrodinger equation 

b. Changing to the rotating frame 

c. Transition probability; Rabi’s formula 

d. Case where the two sublevels are unstable 

3. Relation between the classical treatment and the quantum mechanical treatment: evolution of < M > 

4. Bloch’s equations 

a. A concrete example 

b. Solution in the case of a rotating field 

In chapter IV, we used quantum mechanics to study the evolution of a spin 1/2 
in a static magnetic field. In this complement, we shall consider the case of a 
spin 1/2 simultaneously subjected to several magnetic fields, some of which can 
be time dependent, as is the case in magnetic resonance experiments. Before 
attacking this problem quantum mechanically, we shall briefly review several 
results obtained using classical mechanics. 


1. Classical treatment; rotating reference frame 

a. MOTION IN A STATIC FIELD ; LARMOR PRECESSION 

Consider a system of angular momentum j which possesses a magnetic 
moment m = yj collinear with j (the constant y is the gyromagnetic ratio of the 
system), placed in a static magnetic field B 0 , which exerts a torque m x B 0 on the 
system. The classical equation of motion of j is: 

f ” X B 0 (1) 

or: 

^ m(f) = ym(f) x B 0 (2) 

Performing a scalar multiplication of both sides of this equation by either m(t) 
or B 0 , we obtain: 

£ WO] 2 - 0 (3) 

H) 


A WO ■ B„] - o 
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m(r) therefore evolves with a constant modulus, maintaining a constant angle 
with B 0 . If we project equation (2) onto the plane perpendicular to B 0 , we see that 
m(f) rotates about B 0 (Larmor precession) with an angular velocity of co 0 = — yB 0 
(the rotation is clockwise if y is positive). 

b. INFLUENCE OF A ROTATING FIELD; RESONANCE 

Now assume that we add to the static field B 0 a field B^f), perpendicular 
to B 0 , and which is of constant modulus and rotates about B 0 with an angular 
velocity a> (cf. fig. 1). We set: 

°>o = - Y b o (5) 

Y b i 



FIGURE 1 

Oxyz is a fixed coordinate system. 
The static magnetic field B 0 is 
directed along the Oz axis. The 
OXYZ system [the OX axis is 
the direction of the field B^;)] 
rotates about Oz with the angular 
velocity co. 


We shall designate by Oxyz (unit vectors e x , e y , e 2 ) a fixed coordinate 
system, whose Oz axis is the direction of the field B 0 , and by OXYZ (unit 
vectors e*, e Y , e z ), the axes obtained from Oxyz by rotation through an angle <at 
about Oz [OX is the direction of the rotating field BJr)]. The equation of 
motion of m(r) in the presence of the total field B(r) = B 0 + Bj(/) then becomes: 

J ( m (0 = y m(t) x [B 0 + Bj(t)] (6) 

To solve this equation, it is convenient to place ourselves, not in the absolute 
reference frame Oxyz, but in the rotating reference frame OXYZ, with respect 
to which the relative velocity of the vector m(t) is : 

( dm\ dm , 

VdF) re "-dr- a,e * xm( '> < 7 > 
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Let us set: 

Aa> = oj - a> 0 (8) 

Substituting (6) into (7), we obtain: 

| = m(f) x [A(d e z - (o t e x ] (9) 

rel 



This equation is much simpler to solve than equation (6), since the coefficients of 
the right-hand side are now time-independent. Moreover, its form is analogous 
to that of (2) : the relative motion of the vector m (t) is therefore a rotation about 
the “effective field” B eff (which is static with respect to the rotating reference frame), 
given by (cf fig. 2): 


B e ff = - [A(o e z - 


( 10 ) 



FIGURE 2 

In the rotating reference frame OXYZ, the effective 
field B eff has a fixed direction, about which the magnetic 
moment m (t) rotates with a constant angular velocity 
(precession in the rotating reference frame). 


To obtain the absolute motion of m(t), we must combine this precession about B eff 
with a rotation about Oz of angular velocity <u. 

These first results already enable us to understand the essence of the magnetic 
resonance phenomenon. Let us consider a magnetic moment which, at time 
t = 0, is parallel to the field B 0 (the case, for example, of a magnetic moment 
in thermodynamic equilibrium at very low temperatures : it is in the lowest energy 
state possible in the presence of the field B 0 ). What happens when we apply a weak 
rotating field B 1 (r)? If the rotation frequency (oJ2n of this field is very different 
from the natural frequency (o 0 /2it (more precisely, if Aco = co — a> 0 is much 
larger than eo 1 ), the effective field is directed practically along Oz. The precession 
of m(t) about B eff then has a very small amplitude and hardly modifies the direction 
of the magnetic moment. On the other hand, if the resonance condition oo ^ co 0 is 
satisfied (Aco < ), the angle between the field B eff and Oz is large. The precession 

of the magnetic moment then has a large amplitude and, at resonance {Aco = 0), 
the magnetic moment can even be completely flipped. 
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2. Quantum mechanical treatment 


a. THE SCHRODINGER EQUATION 

Let | + > and | — > be two eigenvectors of the projection S 2 of the spin onto Oz, 
with respective eigenvalues + fi/2 and — fill. The state vector of the system can 
be written: 

|*(0> = a + (t) | + > + a_(t) | - > (11) 

The Hamiltonian operator H(t) of the system is*: 

H{t) = - M . B(t) = - y S . [B 0 + BJ/)] (12) 

that is, expanding the scalar product: 

H[t) = o) 0 S z + co^coscor S x + sin cot SJ (13) 

Using formulas (A-16) and (A-17) of chapter IV, we obtain the matrix which 
represents H in the { | + >, | — > } basis: 

v _n[ to 0 u> l e - ‘"'\ 


Using (11) and (14), we can write the Schrodinger equation in the form: 


i-£ t a + it) =^f a + it) ioH a_{t) 

‘It a -W = ^2 eimt a+ W “ -y 


b. CHANGING TO THE ROTATING FRAME 

Equations (15) constitute a linear homogeneous system with time-dependent 
coefficients. It is convenient to define new functions by setting: 

M') = e ia> " 2 a + {t) 

M 0 = e-"" 2 a_(r) (16) 

Substituting (16) into (15), we obtain a system which has constant coefficients: 

. d , , , A(o , , , a). , ,, 

l Tt b+ ^ = “tMO + ^ L M*) 

< (17) 

. d , , v cd | _ t v Acd , , v 

j ' d t b -^ y = ~ 2 b *^+—mo 


* In expression (12), M. B(f) symbolizes the scalar product M x B x (t) + M y B y (t) + M z B z (t\ 
where M xi M y and M 2 are operators (observables of the system under study), while B x (t\ B y (t) and B 2 (t) 
are numbers (since we consider the magnetic field to be a classical quantity whose value is imposed by an 
external device which is independent of the system under study). 
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This system can also be written: 

m^|«A(t)> = h I m > (is) 

if we introduce the ket 1 1 j/(t )) and the operator H defined by : 

|#)> = Mr)|+> + M0|-> (19) 




( 20 ) 


Transformation (16) has led to equation (18), which is analogous to a 
Schrodinger equation in which the operator H, given in (20), plays the role of a 
time-independent Hamiltonian. H describes the interaction of the spin with a fixed 
field, whose components are none other than those of the effective field introduced 
above in the OXYZ frame [formula (10)]. We can therefore consider that the 
transformation (16) is the quantum mechanical equivalent of the change from the 
fixed Oxyz frame to the rotating OXYZ frame. 


This result can be proved rigorously. According to (16), we can write: 

|*M> = *(0 I *'(')> (21) 

where R(t) is the unitary operator defined by: 

/?(f) = e‘“‘ s * /R (22) 

We shall see later (cf. complement B VI ) that R(t) describes a rotation of the coordinate system 
through an angle at about Oz. (18) is therefore indeed the transformed Schrodinger equation 
in the rotating OXYZ frame. 


Equation (18) is very simple to solve. To determine |$(f)>, given | $(0)), 
all we need to do is expand |$(0)> on the eigenvectors of H (which can be 
calculated exactly) and then apply rule (D-54) of chapter III (which is possible 
since H is not explicitly time-dependent). We then go from |t£(r)> to |*^(0) by 
using formulas (16). 


c. TRANSITION PROBABILITY: RABI'S FORMULA 

Consider a spin which, at time t = 0, is in the state | + >: 

|lM0)> = | + > (23) 

According to (16), this corresponds to: 

I #(0) > = | + > (24) 

What is the probability of finding this spin in the state | — > at time /? 

Since a_(r) and b_(t) have the same modulus, we can write: 

*,.(<)-|< - I W)>| 2 - MX 2 - IMOl 2 - |< - I W>| ! (25) 
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We must therefore calculate |< — |$(f)>| 2 , where |$(/)> is the solution of (18) 
which corresponds to the initial condition (24). 

The problem we have just posed has already been solved, in § C-3-b of 
chapter IV. To use the calculations of that section, all we need to do is apply 
the following correspondences: 

I <Pi ) —* | + ) 

I <Pl > — > | - > 

£j — ~\ Aoj 


Rabi’s formula [equation (C-32) of chapter IV] then becomes: 

= ~ 2 - ^h ^ sin2 4] ( 2? ) 

(0 2 t + (dw) 2 L 2 J 

The probability _(f) is, of course, zero at time t = 0 and then varies sinusoidally 

with respect to time between the values 0 and —-—— -Again, we have a reso- 

xo, + (Aco) 2 

nance phenomenon. For |d<w| > |a> 1 1, &+ _(t) remains almost zero (c/. fig. 3-a); 
near resonance, the oscillation amplitude of ^+_(f) becomes large and, when the 

condition Aa> = 0 is exactly satisfied, we have _ (/) = 1 at times t = + (c/. 

fig. 3-b). ° Jl 



FIGURE 3 


Variation with respect to time of the transition probability between the states | -f > and | — >, under 
the effect of a rotating magnetic held Outside resonance (fig. a), this probability remains small; 
at resonance (fig. b), however small the field B,, there exist times when the transition probability is 
equal to 1. 
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Thus we again find the result which we have already obtained classically : 
at resonance, a very weak rotating field is able to reverse the direction of the spin. 
Note, moreover, that the angular frequency of the oscillation of 

is \/<o\ + (Aco) 2 = |yB e// |. This oscillation corresponds, in the rotating frame, 
to the projection onto OZ of the precession of the magnetic moment about the 
effective field, sometimes called “Rabi precession” [see also the calculation 
of _(t) in complement C IV , §3-c]. 


d. CASE WHERE THE TWO LEVELS ARE UNSTABLE 


We are now going to assume that the two states | ± > correspond to two sublevels of an 
excited atomic level (whose angular momentum is assumed equal to 1/2). n atoms are excited 
per unit time, all being raised to the state | 4- >*. An atom decays, by spontaneous emission of 
radiation, with a probability per unit time of 1/t, which is the same for the two sublevels | ± >. 
We know that, under these conditions, an atom which was excited at time — t has a probabi¬ 
lity e" f/r of still being excited at time t = 0 (cf. complement K m ). 

We assume that the experiment is performed in the steady state : in the presence of the 
fields B 0 and Bj(/), the atoms are excited at a constant rate n into the state | + >. After a time 
much longer than the lifetime t, what is the number N of atoms which decay per unit time 
from the state | - )? If an atom is excited at time - t , the probability of finding it in the 
state | — ) at t = 0 is e~ l/T ^+ _(*), where is given by relation (27). The total number 

of atoms in the state | — > is obtained by taking the sum of atoms excited at all previous times - ?, 
that is, by calculating the integral: 

f e _t/t _(f) n dt (28) 

^0 


This calculation presents no difficulties. Multiplying the number of atoms thus obtained by 
their probability 1/t of decay per unit time, we obtain: 


n 

N = - 


w\ 


2 (Aw) 2 + w\ + (1/t) 2 


(29) 


The variation of N with respect to Aw corresponds to a Lorentz curve whose half-width is: 

L = \/(»\ + (1/t) 2 (30) 

In the experiment described above, let us measure, for various values of the magnetic 
field B q (that is, with w assumed to be fixed, for various values of Aw\ the number of atoms 
which decay from the level | - >. According to (29), we obtain a resonance curve which has 
the shape shown in figure 4. 

It is very interesting to obtain such a curve experimentally, since one can use it to 
determine several parameters: 

- if we know w and measure the value of the field B 0 which corresponds to the 
peak of the curve, we can deduce the value of the gyromagnetic ratio y through the relation 
y = - oil Bq. 


* In practice, this excitation can be produced, for example, by placing the atoms in a light beam. 
When the incident photons are polarized, conservation of angular momentum, in certain cases, requires 
that the atoms which absorb them can attain only the state | + > (and not the state | - > ). Similarly, by 
detecting the polarization of the photons re-emitted by the atoms, one can know whether the atoms fall 
back into the ground state from the state | 4* > or the state | - >. 
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FIGURE 4 

Resonance curve. To observe a resonance phe¬ 
nomenon, we perform an experiment in which 
n atoms are excited per unit time into the 
state | 4 - >. Under the effect of a field 
rotating at the frequency co/2rc, the atoms 
undergo transitions towards the state | - >. 
In the steady state, if we measure the number N of atoms which decay per unit time from the state 
| — >, we obtain a resonant variation when we scan the static field B 0 about the value — <o/y. 




— if we know y 9 we can, by measuring the frequency cojln which corresponds to 
resonance, measure the static magnetic field £ 0 . Various magnetometers, often of very great 
precision, operate on this principle. In certain cases, one can derive interesting information 
from such a measurement of the field. If, for example, the spin being considered is that of a 
nucleus which belongs to a molecule or to a crystal lattice, one can find the local field seen 
by the nucleus, its variation with the site occupied, etc. 

— if we trace the square L 2 of the half-width as a function of (n\> we obtain a straight line 
which, extrapolated to co 1 = 0, gives the lifetime t of the excited level (cf. fig. 5). 


L 2 



FIGURE 5 

The extrapolation to o>, = 0 of the squared 
half-width L of the resonance curve of figure 4 
gives the lifetime of the level being studied. 


3. Relation between the classical treatment 

and the quantum mechanical treatment: evolution of < M > 

The results obtained in §§ 1 and 2 are very similar, although we used classical 
mechanics in one case and quantum mechanics in the other. We are now going to 
show that this similarity is not accidental. It arises from the fact that the quantum 
mechanical evolution equations of the mean value of a magnetic moment 
placed in an arbitrary magnetic field are identical to the corresponding classical 
equations. 

The mean value of the magnetic moment associated with a spin 1/2 is: 

< M >(t) = y < S >(f) (31) 

To calculate the evolution of < M ) (/), we use theorem (D-27) of chapter III: 

0i^<M>(t)- <[M,ff(f)]> (32) 
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where H(t ) is the operator: 

H{t) = - M . B(r) (33) 

Let us calculate for example the commutator [A/ x , //(f)]. Using the fact that the 
field components B y (t) and B z (t) are numbers (cf. note of §2-a), we find: 

[«,.«(')] = - 7 2 [.S,,.SV8,(i) + + SM /)] 

- - r ! 5,«)[S,, SJ - fBMlS,, SJ (34) 

Using relations (14) of complement A IV , we obtain: 

[M x , //(f)] = ifty 2 [B z (t )S y - B,(f)S z ] (35) 

Substituting (35) into (32): 

^<M 1 >(t) = y [B z (0 < M y >(t) - B y (t) < M 2 >(f)] (36) 

By cyclic permutation, we can calculate analogous expressions for the components 
on Oy and Oz ; the three equations obtained can be condensed into: 

< M >(f) = y < M >(f) x B(f) (37) 


Let us compare (37) with (6) : the evolution of the mean value < M > (t) obeys 
the classical equations exactly, whatever the time-dependence of the magnetic 
field B(f). 


4. Bloch's equations 

In practice, in a magnetic resonance experiment, it is not the magnetic moment of a single 
spin that is observed, but rather that of a great number of identical spins (as in the experiment 
described in §2-d above, where the number of atoms which decay from the state | — ) is 
detected). Moreover, one is not concerned solely with the quantity ^+_(f), calculated above. 
One can also measure the global magnetization Ji of the sample under study : the sum of the 
mean values of the observable < M > corresponding to each spin of the sample*. It is interesting, 
therefore, to obtain the equations of motion of Jt , called the Bloch equations . 

In order to understand the physical significance of the various terms appearing in these 
equations, we are going to derive them for a simple concrete case. The results obtained can be 
generalized to other more complicated situations. 

a. A CONCRETE EXAMPLE 

Consider a beam of atoms issuing from an atomic polarizer of the type studied in 
§B-l-a of chapter IV. All the atoms of the beam** are in the spin state | 4- > and therefore 
have their magnetic moments parallel to Oz. They enter a cell C through a small opening (fig. 6), 
rebound a certain number of times from the inside walls of the cell and, after a certain time, 
escape through the same opening. 

* It is possible to detect, for example, the electromotive force emf induced in a coil by the variation 
of M with respect to time. 

** For example, silver or hydrogen atoms in the ground state. For the sake of simplicity, all 
effects related to nuclear spin are neglected. 
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FIGURE 6 


Schematic drawing of an experimental device which supplies cell C with atoms in the state | 4* >. 


We shall denote by n the number of polarized atoms entering the cell per unit time; 
n is generally small and the atomic density inside the cell is low enough to allow atomic 
interactions to be neglected. Moreover, if the inside walls of the cell are suitably coated, 
collisions with the walls have little effect on the spin state of the atoms*. We shall assume 
that there is a probability per unit time 1 /T R for the elementary magnetization introduced into 
the cell by a polarized atom to disappear, either because of a depolarizing collision with the 
walls or simply because the atom has left the cell. T R is called the “relaxation time”. The cell 
is placed in a magnetic field B(t) which may have a static component and a rotating component. 
The problem consists of finding the equation of motion of the global magnetization Jt (t) 
of the atoms which are inside the cell at time t . First, let us write the exact expression for Jt(t ): 

ji(t) = I < «/' ,i) (01M I «A‘°(0 > = £ ^"’(O ( 38 > 

i = 1 i = 1 

In (38), the sum is taken over the JT spins which are already in the cell and which, at time have 
neither left nor undergone a depolarizing collision. | \j/ {l) (t) ) is the state vector of such a spin (i) 
at time t [we are not counting, in (38), the spins which have undergone a depolarizing collision 
and have not yet left the cell, since their global contribution is zero : their spins point 
randomly in all directions]. 

Between times t and t 4- d/, Jt (/) varies for three different reasons : 

(i) A certain proportion, dt/T R > of the Jf spins undergo a depolarizing collision or leave 
the compartment; these spins disappear from the sum (38) and Jt(t) therefore decreases by: 

d *(t)= (39) 

*R 

(i ii ) The other spins evolve freely in the field B(f). We saw in § 3 above that, for each 
of them, the evolution of the mean value of M : 

JtW(t) = < i// (i) (t) |M| ^ (i) (0> 
obeys the classical law: 

dJt'^t) = yjt (i) (t) X B(r) d t (4°) 

Since the right-hand side of (40) is linear with respect to the contribution of these spins 

to the variation of JK(t) is given by: 

dJf(t) = yJ((t)xB(t)dt (41) 

* For example, for hydrogen atoms bouncing off teflon walls, tens of thousands of collisions 
are required for the magnetic moment of the hydrogen atom to become disoriented. 
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(hi) Finally, a certain number, n dl, of new spins have entered the cell. Each of them 
adds to the global magnetization a contribution |i 0 , equal to the mean value of M in the state | 4- > 

|V 0 is parallel to Oz and ||i 0 | = \y\ M therefore increases by: 

d M(t) = n n 0 dr (42) 

The global variation of M is obtained by adding (39), (41) and (42). Dividing by dr, 
we obtain the equation of motion of J((t) (Bloch equation); 

^■*(0 = +yi(t)xB(() (43) 

We have derived (43) in a specific case, making certain hypotheses. However, the main 
features of this equation remain valid for a great number of other experiments where the rate 
of variation of A(t) appears in the form of a sum of three terms : 

— a source term (here n ji 0 ) which describes the preparation of the system. It would, 
in fact, be impossible to observe magnetic resonance without a preliminary polarization of the 
spins, which can be achieved through selection using a magnetic field gradient (as in the example 
studied here), a polarized optical excitation (as in the example studied in §2-d above), cooling 
of the sample in a strong static field, etc. 

— a damping term ^here - J((t)j which describes the disappearance or “ relaxation *’ 

of the global magnetization under the effect of various processes : collisions, disappearance 
of atoms, change in atomic levels through spontaneous emission (as in the example studied 
in §2-d), etc. 

— a term which describes the precession of Jt{t) in the field B(/) [last term of (43)]. 


SOLUTION IN THE CASE OF A ROTATING FIELD 


When the field B(f) is the sum of a static field B 0 and a rotating field B t (l), such as those 
considered above, equations (43) can be solved exactly. As in §§ 1 and 2, one changes to the 
rotating frame OXYZ , with respect to which the relative variation of Jt (r) is: 

(t,*) = n Ho - * + 1* X B eff (44) 


[where B eff is defined by equation (10)]. 

Projecting this equation onto OX, OY and OZ, we obtain a system of three linear 
differential equations with constant coefficients whose stationary solution (valid after a time 
much greater than T K ) is: 


(^x)s — — n t*o 


Wj Aoj 

(Aojf+wl +( 1 / 7 ^ 


( y M\)s — “ n Ho 


(Aco) 2 +o)\ +(i/r*) 2 


z)s = 




(Aco) 2 + to 2 + (1 /T r ) 2 
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The three components of the stationary magnetization (JK ) S9 when the field B 0 varies, have 
resonant variations about the value B 0 — — c ojy (cf. fig. 7). (Jt Y )s an ^ (^z)s gi ye absorption 

curves ^Lorentz curves of width ~ v /tof 4- (1/T*) 2 ^. (J? x )s gives a dispersion curve (of the same 

width). 



FIGURE 7 

Variation with respect to 
Aw = to — co 0 of the station¬ 
ary values of the components 
of M in the rotating frame. 
One obtains a dispersion curve 
for (Jt x ) s and absorption 
curves for (Jf Y ) s and ( Jt z ) s . 
The three curves have the same 
width, 2 Jio\ 4 (l/7^) 2 , which 
increases with a They have 
been drawn assuming that 
to, = 1/7^ (“half-saturation ”). 


The comments made at the end of § 2-d above about the experimental interest of such 
curves can be repeated here. 


References and suggestions for further reading : 

Feynman II (7.2), chap. 35; Cagnac and Pebay-Peyroula (11.2), chaps. IX §5, 
X §5, XI §§2 to 5, XIX §3; Kuhn (11.1), §VI, D. 

See the references of section 14 of the bibliography, particularly Abragam (14.1) 
and Slichter (14.2). 
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1. Description of the model 

2. Eigenfunctions and eigenvalues of the Hamiltonian 

a. Infinite potential barrier 

b. Finite potential barrier 

c. Evolution of the molecule. Inversion frequency 

3. The ammonia molecule considered as a two-level system 

a. The state space 

b. Energy levels . Removal of the degeneracy due to the transparency of the potential barrier 

c. Influence of a static electric field 


1. Description of the model 

In the ammonia molecule NH 3 , the three hydrogen atoms form the base of 
a pyramid whose apex is the nitrogen atom (cf. fig. 1). We shall study this 
molecule by using a simplified model with the following features : the nitrogen 
atom, much heavier than its partners, is motionless; the hydrogen atoms form 
a rigid equilateral triangle whose axis always passes through the nitrogen 
atom. The potential energy of the system is thus a function of only 




o 



FIGURE I 

Schematic drawing of the ammonia molecule; x is the alge¬ 
braic distance between the plane of the hydrogen atoms and 
the nitrogen atom, which is assumed to be motionless. 


one parameter, the (algebraic) distance x between the nitrogen atom and 
the plane defined by the three hydrogen atoms*. The shape of this potential 
energy V(x) is given by the solid-line curve in figure 2. The symmetry of the 
problem with respect to the x = 0 plane requires V (x) to be an even function of x. 
The two minima of V(x ) correspond to two symmetrical configurations of the 
molecule in which, classically, it is stable; we shall choose the energy origin 


* In this one-dimensional model, effects linked to the rotation of the molecule are obviously not 
taken into account. 
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such that its energy is then zero. The potential barrier at x = 0, of height V ,, 
expresses the fact that, if the nitrogen atom is in the plane of the hydrogen atoms, 
they repel it. Finally, the increase in V(x) when |x| is greater than b corresponds 
to the chemical bonding force which insures the cohesion of the molecule. 



FIGURE 2 

Variation with respect to x of the potential energy V(x) of the molecule. V(x) has two minima (clas¬ 
sical equilibrium positions), separated by a potential barrier due to the repulsion for small |x| between 
the nitrogen atom and the three hydrogen atoms. The “square potential” used to approximate V(x) 
is shown in dashed lines. 


This model therefore reduces the problem to a one-dimensional one in which 
a fictitious particle of mass m ^it can be shown that the “ reduced mass ” m of the 

Wl \ 

system is equal to -— a —*—) is under the influence of the potential Vix). Under 
im H + m N J 

these conditions, what are the energy levels predicted by quantum mechanics? 
With respect to classical predictions, two major differences appear : 

(i) The Heisenberg uncertainty relation forbids the molecule to have an 
energy equal to the minimum of V(x) (V min = 0 in our case). We have already seen, 
in complements C, and M,„ why this energy must be greater than V min . 

(/'/') Classically, the potential barrier at x = 0 cannot be cleared by a particle 
whose energy is less than V l : the nitrogen atom thus always remains on the same 
side of the plane of the hydrogen atoms, and the molecule cannot invert itself. 
Quantum mechanically, such a particle can cross this barrier by the tunnel 
effect (cf. chap. I, § D-2-c): the inversion of the molecule is therefore always 
possible. We are going to discuss the consequences of this effect. 

We shall be concerned here only with a qualitative discussion of the physical 
phenomena and not with an exact quantitative calculation which would not have 
much significance in this approximate model. For example, we shall try to 
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demonstrate the existence of an inversion frequency of the ammonia molecule, 
without giving an exact or even an approximate value of this frequency. We shall 
therefore simplify the problem still more by replacing the function V (x) by the square 
potential drawn in dashed lines in figure 2 [two infinite potential steps 
at x = ± (b + a/2) and a potential barrier of height V 0 centered at x = 0 and of 
width (2b — a)]. 


2. Eigenfunctions and eigenvalues of the Hamiltonian 

a. INFINITE POTENTIAL BARRIER 

Before calculating the eigenfunctions and eigenvalues of the Hamiltonian 
corresponding to the “square” potential of figure 2, we are going to assume, in 
this first stage, that the potential barrier V 0 is infinite (in which case, no tunnel 
effect is possible). This will lead us to a better understanding of the consequences 
of the tunnel effect across the finite potential barrier of figure 2. We shall therefore 
consider, first of all, a particle in a potential V(x) composed of two infinite wells 
of width a centered at x = ± b (fig. 3). If the particle is in one of these two wells, 
it obviously cannot go into the other one. 



FIGURE 3 

When the height of the potential 
barrier of figure 2 is large, we have two 
practically infinite potential wells of width 
a whose centers are separated by a distance 
of 2 b. 


Each of the two wells of figure 3 is similar to the one studied in complement H,, 
in § 2-C-/1. We can therefore use the results obtained in this complement. The 
possible energies of the particle are: 


2m 


(1) 


with: 



( 2 ) 
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(where n is a positive integer). Each of the energy values is twofold degenerate, since 
two wave functions correspond to it: 



if b-^^x^b+^ 
everywhere else 

if b — ^ ^ — x ^ b + ^ 
everywhere else ( 3 ) 


In the state | <p" >, the particle-is in the infinite well on the right; in the state | <p" ), 
it is in the one on the left. 

Figure 4 shows the first two energy levels of the molecule, which are two-fold 
degenerate. The Bohr frequency (E 2 — E { )/h associated with these two levels 
corresponds, as we saw in complement A m (§ 2-b), to the to-and-fro motion of the 
particle between the two sides of the well on the right (or on the left) when its state 
is a linear superposition of |<p}> and |<pf> (or of \(p\} and | (pi)). Physically, 
such an oscillation represents a molecular vibration of the plane of the three 
hydrogen atoms about its stable equilibrium position, which corresponds to x = + b 
(or x = — b). The frequency of this oscillation falls in the infrared part of the 
spectrum. 




E 2 = 


4tt 2 R 2 

2ma 2 


_ fi 2 n 
1 2 ma 2 


0 



FIGURE 4 

First energy levels obtained in the potential wells of figure 3. The oscillation of the system in one of 
the two wells at the Bohr frequency v — (£ 2 — E t )/h represents the vibration of the molecule about 
one of its two classical equilibrium positions. 


In the rest of the calculations, it is convenient to change bases, in each of 
the eigen subspaces of the Hamiltonian of the particle. Since the function V(x) 
is even, this Hamiltonian //commutes with the parity operator TI(cf. complement F„, 
§4). In this case, a basis of eigenvectors of H can be found which are even or odd; 
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the wave functions of these vectors are the symmetrical and antisymmetrical linear 
combinations : 


<P n s (x) ~ —j= [ <Pl(x) + <p n 2 (x ) ] 
v2 


<pK x ) = -7= t _ pSM ] 

V2 


(4) 


In the states | cp" ) and | q>” >, the particle can be found in one or the other of the 
the two potential wells. 

In what follows, we shall confine ourselves to the study of the ground state, 
for which the wave functions (p\(x ), (pl(x), tp^fx) and <p„(x) are shown in figure 5. 



The states <p\(x) and <p\{x)> shown in figure a, are stationary states with the same energy, respectively 
localized in the right-hand well and the left-hand well of figure 3. To use the symmetry of the pro¬ 
blem, it is more convenient to choose as stationary states the symmetrical state and the anti¬ 
symmetrical state linear combinations of <p[(x) and <p\(x) (figure b). 
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b. FINITE POTENTIAL BARRIER 

Let us try to find the shape of the eigenfunctions of the first energy levels 
when V 0 has a finite value (assumed, nevertheless, to be greater than the energy 
of these levels). 

Inside the two“ square” potential wells (dashed lines in figure 2), V(x) = 0. 
The wave function is therefore of the form: 

if b — ^ < x < + -^ 

(5) 

, a , a 

if b~2<~x^b + 2 


X(x) = A sin 
x(*) = A’ sin 


k\ b + — — x 


k\ b 4- 2 + x 


where k is related to the energy E of the level by the relation: 
_ h 2 k 2 

£= ^r 




As in the preceding paragraph, x( x ) always goes to zero at x = ± (b + a/2), 
since V(x ) becomes infinite at these two points. On the other hand, since V 0 is 
finite, x(x) no longer goes to zero at x = ± (b — all)-, consequently, k no longer 
satisfies relation (2). 

Once again, since V(x) is even, we can look for eigenfunctions of the 
Hamiltonian, * s (x) and x 0 (x), which are respectively even and odd. Let us denote 
by A s and A', A a and A’ a the values of the coefficients A and A', introduced in (5), 
which correspond to x s ( x ) and y 0 (x). We have, obviously: 



The eigenvalues associated with x s and x a will be denoted by E s and E a , which 
enables us, using (6), to define the corresponding values k s and k a of the 
parameter k. 

In the interval — (b — all) ^ x ^ (b — a/1), the wave function is no longer 
zero, as it was before, since V 0 is finite. It must be a linear combination, even or 
odd depending on whether we are considering x s or Xa> of exponentials e qs aX 
and e~“ a aX ; q s and q a are defined in terms of E s a and V 0 by : 


qsa V b 2 {V ° EJ ^ k *“ 


( 8 ) 


with: 


v o 


n 2 a 2 
1m 


(9) 


Therefore, for — (b — a/1) ^ x < (b — all), the functions x s and x a are written: 
X s (^) = B s cosh (? s x) 

X a (x) = B a sinh (q a x) (10) 


460 



A SIMPLE MODEL OF THE AMMONIA MOLECULE 


Finally, we must match the eigenfunctions and their derivatives at x = ± (b — a/2). 
The even solution x s ( x ) must therefore satisfy the conditions: 


A s sin (k s a) = B cosh 


- A s k s cos (k s a) = B s q s sinh 


K 4 - f)] 
inh [4 b - !) 


(in 


Since A s and B s cannot be zero simultaneously, we can take the ratio of equa¬ 
tions (11) to obtain: 


tan 


(M = ~-“ coth <*s(b ~fj 


For the odd solution % a (x ), we obtain in the same way : 
tan (k a a) = tanh^ fl ^h - 


( 12 ) 


(13) 


If q s and q a are replaced by their values in terms of k s and k a , relations (12) and (13) 
can be written: 

tan (kff) =- = - coth 

y« 2 - k 2 s 

and: 

k 

tan ( k a a ) =- - ■ ■ tanh 

vV - k; 




In theory, therefore, the problem is solved. Relations (14) and (15) express 
the energy quantization since they give the possible values of k s and k a and therefore, 
thanks to relation (6), the energies E s and E a (with the condition that they be less 
than V 0 ). The transcendental equations (14) and (15) can be solved graphically. 
A certain number of roots are found : k\, k 2 , ..., k l a , k 2 , ... The root k " is different 
from kg, since equations (14) and (15) are not the same : the energies E" s and E" a are 
therefore different. Of course, when V 0 becomes very large, k" and k n a both approach 
the value nn/a found in the preceding section; this can be seen by letting a 
approach infinity in equations (14) and (15), which yields tan(£ sfl a) = 0, an 
equation equivalent to (2). The energies E" s and E" a therefore approach the 
value E n = h 2 n 2 n 2 jlma 2 calculated in the preceding section for V 0 approaching 
infinity. Finally, it is easy to see that, the more V 0 exceeds E n , the closer together 
the two energies E” and E” a will be. 

The exact values of E " and E n a are of little importance to us here. We shall 
content ourselves with sketching the shape of the energy spectrum in figure 6, 
which shows what happens to the energies of levels E l and E 2 of figure 4 when 
the finite height V 0 of the potential barrier is taken into account. We see that the 
tunnel effect across this barrier removes the degeneracy of E t and E 2 , giving 
rise to doublets, (El, El) and (Ef El) (assuming, of course, that all these energies 
are less than V 0 ). Since the (El, El) doublet is the deeper one, it is clear 
that |El — El\ < | E 2 s — El\. Finally, the distance between the doublets is much 
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greater than the spacing within each doublet (experimentally, their ratio is of the 
order of a thousand). These spacings enable us, moreover, to define new Bohr 
frequencies: 





Q 2 



whose physical significance we shall study in the next paragraph (the corresponding 
transitions are represented by arrows in figure 6). 






i hQ 2 

i_ 

FIGURE 6 

When one takes the finite height V 0 of the barrier into account, 
one finds that the energy spectrum of figure 4 is modified: each 
hQ i level splits into two distinct ones. The Bohr frequencies QJ2n 

t and Q 2 l2it corresponding to tunnelling from one well to the other 

are the inversion frequencies of the ammonia molecule for the 
first two vibration levels. The tunnel effect is more important in 
the higher vibration level, so Q 2 > . 


Finally, in figure 7, we have shown the shape of the eigenfunctions xj(*) 
and *i(x) which are given by equations (5), (7) and (10), once k\ and k\ have 
been determined from (14) and (15). We see that they greatly resemble the func- 



FIGURE 7 

Wave functions associated with the levels E' s 
and E a in figure 6. Note the analogy with the 
functions in figure 5-b; however, these new 
functions do not vanish on the interval 

- b + a/2 ^ x ^ b - a/2. 


tions cpl(x) and q>a(x) of figure 5, the essential difference being that the wave 
function is no longer zero in the interval - (b - a/2) ^ x ^ (b - a/2). The reason 
for introducing the cp\ and cp l a basis in the preceding paragraph can now be 
understood : the eigenfunctions xl an d X*, in the presence of the tunnel effect, 
resemble q>\ and (p l a much more than q >J and (p\. 


462 













A SIMPLE MODEL OF THE AMMONIA MOLECULE 


c. EVOLUTION OF THE MOLECULE. INVERSION FREQUENCY 

Assume that at time t = 0, the molecule is in the state: 

|<A(r=0)> = _L[ | x > > + | x > >] (16) 

v 2 

The state vector 1 1 \i(t) > at time t can be obtained by using the general formula (D-54) 
of chapter III; we obtain: 

1 

x/2 

From this we deduce the probability density : 

\4f(x, t )| 2 = j [x s '(x)] 2 + j [x«(^)] 2 + cos (G,f) xl(x) Xa(x) (18) 

The variation with respect to time of this probability density is simple to 
obtain graphically from the curves of figure 7. They are shown in figure 8. 
For t = 0 (fig. 8-a), we see that the initial state chosen in (16) corresponds to 
a probability density which is concentrated in the right-hand well (in the left-hand 
well, the functions xl and x l a are °f opposite sign and very close in absolute value, 
so their sum is practically zero). It can therefore be said that the particle, initially, 
is practically in the right-hand well. At time / = n/2Q l (fig. 8-b), it has moved 



.*1 + ^, 

[ e + ifi on 


Xl> + e 


-iQxtjl 


xl >] 


(17) 




FIGURE 8 

Evolution of a wave packet obtained by super¬ 
posing the two stationary wave functions of figure 7. 
The particle, initially, is in the right-hand well 
(fig. a), tunnels into the left-hand well (fig. b) and, 
after a certain time, becomes localized there 
d) and the initial state (fig. e), and so on. 
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appreciably, through the tunnel effect, into the left-hand well, is practically there 
at time t = n/£2 i (fig. 8-c), and then performs the process in reverse (figures 8-d 
and 8-e). 

The fictitious particle therefore moves from one side of the potential barrier 
to the other with the frequency S2 t /2n, which means that the plane of the hydrogen 
atoms continually passes from one side of the nitrogen atom to the other. This is 
why the frequency Q 1 /2n is called the inversion frequency of the molecule. Note that 
this inversion frequency has no classical analogue, since its existence is related to the 
tunnel effect of the fictitious particle across the potential barrier. 

Since the nitrogen atom tends to attract the electrons of the three hydrogen 
atoms, the ammonia molecule possesses an electric dipole moment which is 
proportional to the mean value (X} of the position of the fictitious particle we 
have studied; we see in figure 8 that this dipole moment is an oscillating function 
of time. Under these conditions, the ammonia molecule is capable of emitting or 
absorbing electromagnetic radiation of frequency QJ2n. 

Experimentally, this is indeed observed; the value of Q 1 falls in the domain 
of centimeter waves. In radioastronomy, ammonia molecules in interstellar space 
have been shown to emit and absorb electromagnetic waves of this frequency. 
Let us also point out that the principle of the ammonia maser is based on the 
stimulated emission of these waves by the NH 3 molecule. 

3. The ammonia molecule considered as a two-level system 

We see in figure 6 that we have a situation which is analogous to the one 
mentioned in the introduction of § C of chapter IV. The system under study 
possesses two levels, El and Ef which are very close to each other and very far 
from all other levels Ej, Ej, ... If we are interested only in the two levels El and E l a , 
we can “forget” all the others (the exact justification for such an approximation 
will be given in the framework of perturbation theory in chapter XI). 

We are going to return to the preceding discussion with a slightly different 
point of view and show that the general considerations of chapter IV concerning 
two-level systems can be applied to the ammonia molecule. This point of view will 
also enable us to study very simply the effect of a static external electric field 
on this molecule. 


a. THE STATE SPACE 


The state space we are going to consider is spanned by the two orthogonal 
vectors | <p[ ) and | (p\ ), whose wave functions are given by (3). As we explained 
above, we shall ignore the other states | cp" > and | (p" > for which n > 1. In the 
states | (p\ ) and | <p\ ), the nitrogen atom is either above or below the plane of the 
hydrogen atoms. We introduced in (4) a second orthonormal basis of the state 
space, composed of the even and odd vectors: 


K> 

|<c' > 


= T2 Uv1> 
=7I [W > 


+1 <p l 2'.y ] 

-1 <p\ >] 


(19) 
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There is the same probability in these two states of finding the nitrogen atom 
above or below the plane of the hydrogen atoms. 


b. ENERGY LEVELS. REMOVAL OF THE DEGENERACY 

DUE TO THE TRANSPARENCY OF THE POTENTIAL BARRIER 

When the height V 0 of the potential barrier is infinite, the states | cp\ ) 
and | cp\ > have the same energy (as do the states | (p\ ) and | <p\ >), so that H 0 , 
the Hamiltonian of the system, is written : 

H 0 = E x x H (20) 

(where U is the identity operator in the two-dimensional state space). 

To take into account phenomenologically the fact that the barrier is not 
infinite, we add to H 0 a perturbation W which is non-diagonal in the { | tp \), | <p\ > } 
basis and is represented by the matrix : 


W = — A 


where A is a real positive coefficient*. 

If we want to find the stationary states of the molecule, we must now diago¬ 
nalize the total Hamiltonian operator H = H 0 + W, whose matrix is written: 

H - {-A * ) P2) 

A simple calculation gives the eigenvalues and eigenvectors of H : 

E i + A corresponding to the eigenket | (pi > 

E,-A ” ” ” \(pl > (23) 

We see that, under the effect of the perturbation W, the two levels, which were 
degenerate when A was zero, now split; an energy difference, equal to 2 A, appears, 
and the new eigenstates are the states | <p‘ ) and | (f ,\). We again find the results 
of § 2 . 

If, at time t = 0, the molecule is in the state | <p\ >: 

| 'I'U = °) > = | <P\ > = [ \<Pl > + | <Pl >3 (24) 

the state vector at time t will be: 

| »A(r) > = - 7 = e~ iE,,lh [e iA,lh | > + e~ iA,lh | (pi >] 

v 2 


= e iE,uh cos | (p\y + i sin | (p\ ) 


^ We are forced to assume A > 0 in order to obtain the relative disposition of the E\ and levels 
of figure 6 [see eigenvalues (23)], 
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In a measurement performed at time t, we therefore have a probability cos 2 (AtjTi) 
of finding the molecule in the state | cp\ > (the nitrogen atom above the plane of the 
hydrogen atoms) and sin 2 (AtjTi) of finding it in the state | cp\ ) (the nitrogen atom 
below). Thus we again find that, under the effect of the coupling W, the ammonia 
molecule inverts periodically. 

comment : 

The perturbation W [given in (21)] describes (phenomenologically) the 
fact that the potential barrier is finite. This approach is less precise than the 
discussion above, since we obtain here eigenfunctions <p‘(x) and (pi (x) which, 
unlike and Xa> go to zero ,n the region (— b + a/2) < x < (b — a/2). 
This much more simple description nevertheless explains two fundamental 
physical effects : the removal of the degeneracy of £, and the periodic 
oscillation of the molecule between the states | <p{ ) and | q>\ > (inversion). 


c. INFLUENCE OF A STATIC ELECTRIC FIELD 


We saw above that, in the states | <p| ) and | q>\ >, the electric dipole moment 
of the molecule takes on two opposite values which we shall denote by + rf and 
— tj. If we call D the observable associated with this physical quantity, we can 
therefore assume that D is represented in the { | (p\ >, | (p\ > } basis by a diagonal 
matrix whose eigenvalues are + t] and — tj: 


D = 



(26) 


When the molecule is placed in a static electric field S’*, the interaction 
energy with this field is: 


W(S) = - SD 


(27) 


This term of the Hamiltonian** is represented in the { | (p\ >, | (p\ ) } basis by the 
matrix: 


W'(£) = 




(28) 


Let us then write the matrix which represents, in the { | (p \), | (p\ > } basis, the 
total Hamiltonian operator of the molecule, H 0 + W + W’(S): 


H 0 + W + W'(S) = 





(29) 


* For the sake of simplicity, we assume here that this field is parallel to the Ox axis of figure 1 
(one-dimensional model). 

** In W'(&) y D is an observable, while & is a classical quantity which is externally imposed (cf. note, 
page 446). 
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This matrix can easily be diagonalized; 
eigenvectors | ij/ + ) and 1 i//_ > are given by: 

£+=£,+ v A 2 + t] 2 8 2 

£_ = £, - \fA 2 + t] 2 8 2 

and : 

I •K > = cos 11 (p\ > - sin ^ | q>\ > 

| •A- > = sin 11 <*>! > + cos| | > 

where we have set: 

tan 6 = — 0 ^ 0 < 7 i 

t]8 

[cf. complement B IV , relations (9), (10), (22) and (23); since A is real and negative, 
the angle (p introduced in that complement is here equal to tt]. 

When 8 is zero, 6 = n/2, and we again obtain the results of §3-b, since : 


E + (8 = 0) = E, + A 

E_(8 = 0) = £, - A 

(33) 

with: 


\tP + (8=0)> = \q>i > 

| *-(* = «)> = | (Pi > 

(34) 

When, for arbitrary 8, A is zero 
obtain: 

(a perfectly opaque potential barrier), we 

E + (A =0) = £, + tj\8\ 

E_(A= 0) = £, - n \8\ 

(35) 

with, if 8 is positive*: 


|*-M = 0 )> = \(p\ > 

|<M/i = o)> = - ki> 

(36) 


In this case, the energies therefore vary linearly with 8 (dashed straight lines 
in figure 9). Physically, results (35) and (36) are easy to understand : when the 
electric field alone acts on the molecule, it “pulls” the positively charged hydrogen 
atoms above or below the nitrogen atom; this is why the stationary states 
are | q>\ > and | <p\ ). 

When the electric field 8 and the coupling constant A are both arbitrary, 
the states 11 p + } and | ) are linear superpositions of the states | q>\ ) and | cp\ > 

(and of the states | > and | > as well), and result from a compromise between 

the action of the electric field, which tends to pull the hydrogen atoms to one side 


its eigenvalues E + and E_ and its 

(30) 

(31) 

(32) 


* If ^ is negative, the roles of | <p\ > and | <p\ ) are inverted in (36). 
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FIGURE 9 

Influence of an electric field S on the first two levels of the ammonia molecule (their spacing 2A in a 
zero field is due to the tunnel effect coupling). For weak the molecule acquires a dipole moment 
proportional to and the corresponding energy varies with S 2 . For large the dipole moment 
approaches a limit (corresponding to the nitrogen atom either above or below the plane of the hydrogen 
atoms), and the energy becomes a linear function of . 


of the nitrogen atom, and that of the coupling W, which tends to draw the nitrogen 
atom accross the potential barrier. The variation of the energies E + and E is 
shown graphically in figure 9, in which we see the phenomenon of anti-cros¬ 
sing (cf. chap. IV, §C-2-b) due to the coupling W. E + and E_ correspond to the 
two branches of a hyperbola whose asymptotes are the dashed lines associated 
with the energies in the absence of coupling. 

Finally, we can calculate the mean values of the electric dipole moment D 
in each of the two stationary states | ip + ) and 1 1 p_ >. Using (26) and (31), we find : 

< | D | ^ + > = - < »A_ | D | > = n cos 6 (37) 

which, according to (32), yields: 

< .K I D | * + > = - < «A_ I D I > = - -T-?-- (38) 

V A 2 + r\ 2 £ 2 

For £ — 0, these two mean values are zero. This corresponds to the fact that, in 
the two states | q>l a ), the particle has an equal probability of being in one or the 
other of the two wells. On the other hand, when t]£ > A, we again find the dipole 
moment + rj (or — tj) corresponding to the state | <p\ > (or | q>\ >). 

When the electric field is weak (q£ < A), formulas (38) can be written in the 

form: 

<^ + |D|^ + >= -<^_|Z)|^_>= ~^£ (39) 

We see that the molecule in the stationary state |i p + } (or |^_ >) acquires an 
electric dipole moment proportional to the external field £. If we define an 
electric susceptibility of the molecule in the state | ) by the relation: 

< | D | £ (40) 
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we find, according to (39), that: 



(the same calculations are valid for |t^ + ) and yield a + = — e_). 


(41) 


COMMENT : 


In a weak field, formulas (30) can be expanded in power series of qS/A: 
E- = E y — A — + ... (42-a) 

E+ = Ey + A + + ... (42-b) 


Let us now consider ammonia molecules moving in a region where S is 
weak but where S 1 has a strong gradient in the Ox direction (i.e. along the 
axis of the molecules): 



(43) 


According to (42-a), the molecules in the state | tj/_ > are subjected to a force 
parallel to Ox which is equal to: 


F 


d£_ 1 n 1 

dx ” 2 1 A 


(44) 


Relation (42-b) indicates that the molecules in the state \ip + > are subjected 
to an opposite force: 


F+ = 


d£ + 
dx 


- F 


(45) 


This result is the basis of the method which is used in the ammonia maser 
to sort the molecules and select those in the higher energy state. The device 
used is analogous to the Stern-Gerlach apparatus: a beam of ammonia 
molecules crosses a region where there is a strong electric field gradient; the 
molecules follow different trajectories depending on whether they are in 
one state or the other; one can, using a suitable diaphragm, isolate either 
one of the two states. 


References and suggestions for further reading: 

Feynman III (1.2), §8-6 and chap. 9; Alonso and Finn III (1.4), §2-8; article by 
Vuylsteke (1.34); Townes and Schawlow (12.10), chap. 12; see (15.11) for references to 
original articles on masers; articles by Lyons (15.14), Gordon (15.15), and Turner (12.14). 
See also Encrenaz (12.11), chap. VI. 
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IV 


Complement H IV 

EFFECTS OF A COUPLING BETWEEN A STABLE STATE 
AND AN UNSTABLE STATE 


1. Introduction. Notation 

2. Influence of a weak coupling on states of different energies 

3. Influence of an arbitrary coupling on states of the same energy 


1. Introduction. Notation 


The effects of a coupling W between two states | (p x ) and | cp 2 > of energies E x 
and E 2 were discussed in detail in § C of chapter IV. What modifications appear 
when one of the two states (| q> l ), for example) is unstable? 

The concepts of an unstable state and a lifetime were introduced in 
complement K m . We shall assume, for example, that |<p, ) is an excited atomic 
state. When the atom is in this state, it can fall back to a lower energy level through 
spontaneous emission of one or several photons, with a probability l/r t per 
unit time : t x is the lifetime of the unstable state | ). On the other hand, we assume 

that in the absence of the coupling fV, the state | <p 2 > is stable (r 2 is infinite). 

We saw in complement K,„ that a simple way of taking the instability of 
a state into account consists of adding an imaginary term to the corresponding 
energy. We shall therefore replace the energy E t of the state \(p l > by: 

E\ =£, - (1) 

with: 

»■ - T, (2 > 

(since x 2 is infinite, y 2 is zero and E' 2 = E 2 ). In the absence of coupling, the matrix 
which represents the “Hamiltonian” H 0 of the system can therefore be written 
in the ( | >, | (p 2 > } basis* : 



* The operator H 0 is not Hermitian and is therefore not really a Hamiltonian (see the comment 
at the end of complement K,„). 
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2. Influence of a weak coupling 
on states of different energies 


Now let us assume, as in § C of chapter 1Y, that we add to H 0 a perturba¬ 
tion W , whose matrix in the { | </> L >, | (p 2 > } basis is: 



What now happens to the energies and lifetimes of the levels? 
Let us calculate the eigenvalues e\ and e 2 of the matrix: 


H = H 0 +W = 


F — i- -> W 
c i ' 2 > i 2 


V w 2l 


EiJ 


(5) 


and e 2 are the solutions of the equation in e: 


f. 


2 


E\ + E 2 



+ E l E 2 



( 6 ) 


To simplify the calculation, we shall confine ourselves to the case where the 
coupling is weak 


Wv. 





E iY 



we then find: 

e i - E i ~ 

c' 2 =s E 2 + 





E i ~ E i + * * Vi 


(7) 


The energies of the eigenstates in the presence of the coupling are the real 
parts of e\ and e 2 ; the lifetimes are inversely proportional to their imaginary parts. 
We see from (7) that the coupling changes, to second order in | W 12 \, both the energies 
and the lifetimes. In particular, we observe that e[ and e' 2 are both complex 
when | W 12 \ is not zero: in the presence of the coupling, there is no longer any 
stable state. We can write e 2 in the form: 


, _ „ .» r 
c 2 — d 2 l ^ T> 

(8) 

with: 



(9-a) 

(£2 - E ,) 2 +^Vt 


r.-r.- 

(9-b) 

( E 2 ~ E i) 2 +^y\ 



The state | (p 2 ) therefore acquires, under the effect of the coupling, a finite 
lifetime whose inverse is given in (9-b) (Bethe’s formula). This result is easy to 
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understand physically: if the system at / = 0 is in the stable state | <p 2 >, there 
is a non-zero probability at a subsequent time t of finding it in the state | <p l >, in 
which the system has a finite lifetime. It is sometimes said figuratively that “the 
coupling brings into the stable state part of the instability of the other state”. 
Moreover, it can be seen from expressions (7) that, as in the case studied in §C 
of chapter IV, the smaller the difference between the unperturbed energies E i 
and E 2 , the more effectively the perturbation acts on the energies and lifetimes. 
We shall therefore study in the next section the case where this difference is zero. 


3. Influence of an arbitrary coupling 
on states of the same energy 


When the energies E x and E 2 are equal, the operator H is written, if we 
make its trace appear explicitly, as in § 2 of complement B, v : 


// = (e, -i^ 7 l y+ k 


( 10 ) 


where H is the identity operator and K is the operator which, in the { | (p l >, | <p 2 ) } 
basis, has for its matrix: 


/ -4v. 


(K) = 


‘TV, 


(H) 


.h 

4' 1 / 

The eigenvalues k Y and k 2 of K are the two solutions of the characteristic 
equation: 

k‘= \w lt \‘ ( 12 ) 

They therefore have opposite values: 

k x = - k 2 (13) 

which yields for the eigenvalues of H : 


e 'i = E t ~ ifVi + 

, , .ft , 

£2 = E i - - fci 


(14) 


The eigenvectors of H and K are the same; a simple calculation enables us 
to obtain these vectors | tf/[ ) and 1>*: 


> = ^12 | <Pi > + ( + ' 4 ^i j I Vi > 

4 >'i > = ^12 1 <px > + I - + i\yi) I <p 2 > 


(15) 


* For the calculation performed here, it is not indispensable to normalize | > and | ^ >* 

Note also that since H is not Hermitian, and 1> are not orthogonal. 
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Assume that the system at time t = 0 is in the state | <p 2 > (which would 
be stable in the absence of the coupling): 


| # = 0 ) > = | (/> 2 > = -^ [| > - | >] 

Using (14), we see that, at time t , the state vector is: 

I m > = w, e " ,E ‘^ I n > - Qik,,lh I n >] 


(16) 


(17) 


The probability ^ 21 (r) °f finding the system at time t in the state | <p, > is: 
^2l(t) = |< <Pl | ^(0 >| 2 




4|fc.| 


e ?itl2 j e ikitlh <(?i | fi> _ e >kU/ti ^ < p j | N,|2 

. e -?.t /2 |H / 12 | 2 |e~ iM/ * - e ikl,/s | 2 


We shall distinguish between several cases: 
(/) When the condition : 

l^l >j7t 

is satisfied, we obtain directly, using ( 12 ): 


- *1 = ^ul 2 -(|y.) 

and the eigenvalues e', and e 2 are given by: 

=£. + JW-fevi) -*?V 


V 


£ 2 = £. - 1^12 


- 6 - 


*4* 


(18) 


(19) 


( 20 ) 


( 21 ) 


e', and e 2 have the same imaginary part, but different real parts. The states 
| t/'i ) and 11 /^ ) therefore have the same lifetime, 2 t,, but different energies. 
Substituting (20) into (18), we obtain : 



The form of this result recalls Rabi’s formula \cf. chap. IV, equation (C-32)]. 
The function ^,(0 is represented by a damped sinusoid with time constant 2t, 
(fig. 1). Condition (19) thus expresses the fact that the coupling is sufficiently 
strong to make the system oscillate between the states | <p, ) and | <p 2 > before 
the instability of the state | </>, ) can make itself felt. 
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FIGURE I 

Effect of a strong coupling between a stable 
state | (p 2 > and an unstable state | <p, >. If the 
system is initially in the state | (p 2 >. the 
probability of finding it in the state 

| <p, > at time t presents damped oscillations. 


( 11 ) If, on the other hand, the condition : 

\Wu\<\yx ( 2 3 ) 

is satisfied, we then have: 



The states | if/[ > and | xf/' 2 ) then have the same energy and different 
lifetimes. Formula (18) becomes: 



This time, # 21 (/) is a sum of damped exponentials (fig. 2). 

This result has a simple physical interpretation : condition (23) expresses 
the fact that the lifetime t, is so short that the system is completely damped 
before the coupling W has had the time to make it oscillate between the 
states | <p, > and | cp 2 >. 



FIGURE 2 

When the coupling is weak, oscillations between the 
states | <p, ) and | <p 2 ) do not have time to occur. 
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(i») Finally, let us examine the case where we have exactly: 

|^ 2 |=|y ; (27) 

We see then from (14) that the states | i j/\ ) and | t j/' 2 ) both have the same 
energy E x and the same lifetime 2 t,. 

Equations (22) and (26), in this case, take on indeterminate forms, which 
both yield: 


^2 At) 


w 2 

ft 12l ( 2 e -yir/2 

ft 2 


(28) 


COMMENT: 

The preceding discussion is very analogous to that of the classical 
motion of a damped harmonic oscillator. Conditions (19), (23) and (27) 
correspond respectively to weak, strong and critical damping. 


References and suggestions for further reading: 

An important application of the phenomenon discussed in this complement is 
the shortening of the lifetime of a metastable state due to an electric field. See: Lamb 
and Retherford (3.11), App. II; Sobel’man (11.12), chap. 8, § 28-5. 
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Complement J (V 
EXERCISES 


1. Consider a spin 1/2 particle of magnetic moment M = yS. The spin 
state space is spanned by the basis of the | + ) and | — ) vectors, eigenvectors of 
S z with eigenvalues + h/2 and — h/2. At time / = 0, the state of the system is : 

| M = 0) > = | + > 

a. If the observable S x is measured at time t = 0, what results can be 
found, and with what probabilities? 

b. Instead of performing the preceding measurement, we let the system 
evolve under the influence of a magnetic field parallel to Oy, of modulus B 0 . 
Calculate, in the { | + >, | — > } basis, the state of the system at time t. 

c. At this time t, we measure the observables S x , S y , S z . What values 
can we find, and with what probabilities? What relation must exist between B 0 
and t for the result of one of the measurements to be certain ? Give a physical 
interpretation of this condition. 


2. Consider a spin 1/2 particle, as in the previous exercise (using the same 
notation). 

a. At time / = 0, we measure S y and find + h/2. What is the state vector 
1 1 ^( 0 ) > immediately after the measurement? 

b. Immediately after this measurement, we apply a uniform time-dependent 
field parallel to Oz. The Hamiltonian operator of the spin H(t) is then written : 

H(t) = a> 0 (t)S z 


Assume that (o 0 (t) is zero for / < 0 and t > T and increases linearly from 0 
to a> 0 when 0 ^ t ^ T (T is a given parameter whose dimensions are those 
of time). Show that at time t the state vector can be written: 


#) > = —j= [ e' e(,) 

V2 


+ > + i e 


- ifl(0 


- >J 


where 0(t) is a real function of t (to be calculated by the student). 

c. At a time / = z > T, we measure S y . What results can we find, and 
with what probabilities? Determine the relation which must exist between a> 0 
and T in order for us to be sure of the result. Give the physical interpretation. 


3. Consider a spin 1 /2 particle placed in a magnetic field B 0 with components : 

B x = 4 Bo 

V2 

< B y = 0 

B- = -4= B 0 

L v 2 
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The notation is the same as that of exercise (I). 

a. Calculate the matrix representing, in the { | + ), | — > } basis, the 
operator H , the Hamiltonian of the system. 

b. Calculate the eigenvalues and the eigenvectors of H. 

c. The system at time / = 0 is in the state | — >. What values can be 
found if the energy is measured, and with what probabilities? 

d. Calculate the state vector | tl/(t) > at time r. At this instant, S x is 
measured; what is the mean value of the results that can be obtained? Give 
a geometrical interpretation. 


4. Consider the experimental device described in §B-2-b of chapter IV 
(cf. fig. 8): a beam of atoms of spin 1/2 passes through one apparatus, 
which serves as a “polarizer” in a direction which makes an angle 9 with O: 
in the xOz plane, and then through another apparatus, the “analyzer”, which 
measures the S z component of the spin. We assume in this exercise that 
between the polarizer and the analyzer, over a length L of the atomic beam, 
a magnetic field B 0 is applied which is uniform and parallel to Ox. We call v 
the speed of the atoms and T = L/v the time during which they are submitted 
to the field B 0 . We set u> 0 — — yB 0 . 

a. What is the state vector | ) of a spin at the moment it enters the 

analyzer? 

b. Show that when the measurement is performed in the analyzer, 
there is a probability equal to ^(1 + cos 9 cos co 0 T) of finding + h/2 

and -^(1 - cos 9 cos a> 0 T) of finding — fi/2. Give a physical interpretation. 

c. (This question and the following one involve the concept of a density 
operator, defined in complement E,„. The reader is also advised to refer to 
complement E IV .) Show that the density matrix p ( of a particle which enters the 
analyzer is written, in the { | + >, | - > } basis: 

1 + cos 9 cos u> 0 T sin 0 + i cos 9 sin oj 0 T 
sin 9 — i cos 9 sin a> 0 T 1 — cos 9 cos u> 0 T 

Calculate Tr { p x S x }, Tr { p 2 S y } and Tr { p l S 2 }. Give an interpretation. Does 
the density operator p l describe a pure state? 

d. Now assume that the speed of an atom is a random variable, and 
hence the time T is known only to within a certain uncertainty AT. In addition, 
the field B 0 is assumed to be sufficiently strong that co 0 AT > l. The possible 
values of the product a> 0 T are then (modulus 2k) all values included between 0 
and 2n, all of which are equally probable. 

In this case, what is the density operator p 2 of an atom at the moment it 
enters the analyzer? Does p 2 correspond to a pure case? Calculate the quantities 
Tr { p 2 S x }, Tr ( p 2 S y } and Tr { p 2 S z }. What is your interpretation? In which 
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case does the density operator describe a completely polarized spin ? A completely 
unpolarized spin ? 

Describe qualitatively the phenomena observed at the analyzer exit when 
co 0 varies from zero to a value where the condition co 0 AT p 1 is satisfied. 


5. Evolution operator of a spin 1/2 (cf. complement F m ) 

Consider a spin 1/2, of magnetic moment M = yS, placed in a magnetic 
field B 0 of components B x = — ojy, B y = — (o y /y, B z = — tojy. 

We set: 

w o = - y |B 0 | 

a. Show that the evolution operator of this spin is: 

U(t, 0) = e~ iMl 

where M is the operator: 

M = \ l>* S * + °>y S y + ^ Z S Z ] = ^ [c» x O x + (O y O y + (0,0 J 

where o x , o y and o, are the three Pauli matrices (cf. complement A IV ). 

Calculate the matrix which represents M in the { | + >, | — > } basis 
of eigenvectors of S z . Show that: 

M 2 - 3 M + <•>] + ^ = (*Y 


b. Put the evolution operator into the form: 
l/M) = cos(^)— 

c. Consider a spin which at time t — 0 is in the state 11^(0) > = | + >. 
Show that the probability 0+ + (t) of finding it in the state | + > at time t is: 

^ + + (0= |< + I o) | + > | 2 


and derive the relation: 

„ , s , (o x + of . ? 

^++W = 1 - . 2 - ' Sin 2 


Give a geometrical interpretation. 



6 . Consider the system composed of two spin 1/2’s, Si and S 2 , and the 
basis of four vectors | ±, ± > defined in complement D 1V . The system at 
time t = 0 is in the state 

| *(0) >=^| + + >+ 4| + ->+-U|--> 

L 1 f 2 
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a. At time t = 0, S ls is measured; what is the probability of finding — ft/2? 
What is the state vector after this measurement? If we then measure S lx ., what 
results can be found, and with what probabilities? Answer the same questions 
for the case where the measurement of S l2 yielded + S/2. 

b. When the system is in the state | i/r(0) > written above, S lz and S 2z are 
measured simultaneously. What is the probability of finding opposite results? 
Identical results ? 

c. Instead of performing the preceding measurements, we let the system 
evolve under the influence of the Hamiltonian: 

H = ct) t S lz + to 2 S 2z 

What is the state vector | ij/(t) > at time f? Calculate at time t the mean values 
< S, > and < S 2 ). Give a physical interpretation. 

d. Show that the lengths of the vectors < Sj > and < S 2 > are less than ft/2. 
What must be the form of 11^(0) > for each of these lengths to be equal 
to + ft/2? 

7. Consider the same system of two spin 1/2’s as in the preceding exercise; 
the state space is spanned by the basis of four states | ±, + >. 

a. Write the 4 x 4 matrix representing, in this basis, the S ly operator. 
What are the eigenvalues and eigenvectors of this matrix ? 

b. The normalized state of the system is: 

|*>«a| + +> + /*|+->+y|-+>+6|--> 

where a, /?, y and S are given complex coefficients. S lx and S 2y are measured 
simultaneously; what results can be found, and with what probabilities? What 
happens to these probabilities if | if/ } is a tensor product of a vector of the state 
space of the first spin and a vector of the state space of the second spin? 

c. Same questions for a measurement of S ly and S 2y . 

d. Instead of performing the preceding measurements, we measure only 
S 2y . Calculate, first from the results of b and then from those of c, the 
probability of finding — ft/2. 


8. Consider an electron of a linear triatomic molecule formed by three 
equidistant atoms. We use | (p A ), | cp B ), | (p c ) to denote three orthonormal 
states of this electron, corresponding respectively to three wave functions localized 
about the nuclei of atoms A, B, C. We shall confine ourselves to the subspace 
of the state space spanned by | cp A >, | (p B > and | q> c >. 

©—©—© 

When we neglect the possibility of the electron jumping from one nucleus 
to another, its energy is described by the Hamiltonian H 0 whose eigenstates 
are the three states | (p A >, | cp B ), | q> c ) with the same eigenvalue E 0 . The 
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coupling between the states | (p A ), | (p B ), | (p c ) is described by an additional 
Hamiltonian W defined by: 

w \(p A y ~ - a \(p B y 

w \q> B y -■= - a\cp A y - a\<p c y 

w | (p c y = - a j <p B y 

where a is a real positive constant. 

1. Calculate the energies and stationary states of the Hamiltonian 
H = H 0 + W. 

2. The electron at time t = 0 is in the state | cp A >. Discuss qualitatively 
the localization of the electron at subsequent times t. Are there any values 
of t for which it is perfectly localized about atom A, B or C? 

3. Let D be the observable whose eigenstates are | (p A >, | <p B >, | <p c > 
with respective eigenvalues — d, 0, d. D is measured at time /; what values can 
be found, and with what probabilities ? 

4. When the initial state of the electron is arbitrary, what are the Bohr 
frequencies that can appear in the evolution of <£>>? Give a physical inter¬ 
pretation of D. What are the frequencies of the electromagnetic waves that can 
be absorbed or emitted by the molecule ? 


A molecule is composed of six identical atoms A x , A 2 , ... A 6 which 
form a regular hexagon. Consider an electron which 
can be localized on each of the atoms. Call | q> n ) the 
state in which it is localized on the «th atom 
(« = 1, 2, ... 6). The electron states will be confined to 
the space spanned by the | q>„ ), assumed to be ortho¬ 
normal. 



R 


a. Define an operator R by the following relations : 

<pi > = I <pi >; R I <Pi > = I <p 3 >; •••; R I ^6 > = I <p\ > 


Find the eigenvalues and eigenstates of R. Show that the eigenvectors of R form 
a basis of the state space. 

b. When the possibility of the electron passing from one site to another 
is neglected, its energy is described by a Hamiltonian H 0 whose eigenstates 
are the six states | <p„ >, with the same eigenvalue E 0 . As in the previous 
exercise, we describe the possibility of the electron jumping from one atom 
to another by adding a perturbation W to the Hamiltonian H 0 ; W is defined by : 

W | <?i > = - a | (p 6 y - a | (p 2 >; W \ <p 2 > = — a \ > - a \ (p 3 >; 

.; w\ (p 6 y = - a I <p s > - a | <?1 > 

Show that R commutes with the total Hamiltonian H = H 0 + W. From this 
deduce the eigenstates and eigenvalues of H. In these eigenstates, is the electron 
localized ? Apply these considerations to the benzene molecule. 


Exercise 9 

Reference : Feynman III (1.2), § 15-4. 
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A. INTRODUCTION 

1. Importance of the harmonic oscillator in physics 

This chapter is devoted to the study of a particularly important physical 
system : the one-dimensional harmonic oscillator. 

The simplest example of such a system is that of a particle of mass m 
moving in a potential which depends only on x and has the form : 

V{x)=\kx 2 (A-l) 

(k is a real positive constant). The particle is attracted towards the x = 0 plane [the 
minimum of V(x), corresponding to positions of stable equilibrium] by a restoring 
force: 

F - = - ^ - - kx < A - 2 > 

which is proportional to the distance x between the particle and the x = 0 plane. 
We know that in classical mechanics, the projection onto Ox of the particle’s motion 
is a sinusoidal oscillation about x = 0, of angular frequency: 

(A-3) 

Actually, a large number of systems are governed (at least approximately) 
by the harmonic oscillator equations. Whenever one studies the behavior of a 
physical system in the neighborhood of a stable equilibrium position, one arrives 
at equations which, in the limit of small oscillations, are those of a harmonic 
oscillator (see §A-2). The results we shall derive in this chapter are applicable, 
therefore, to a whole series of important physical phenomena — for example, the 
vibrations of the atoms of a molecule about their equilibrium position, the 
oscillations of atoms or ions of a crystalline lattice (phonons)*. 

The harmonic oscillator is also involved in the study of the electromagnetic 
field. We know that in a cavity, there exist an infinite number of possible stationary 
waves (normal modes of the cavity). The electromagnetic field can be expanded 
in terms of these modes and it can been shown, using Maxwell’s equations, that 
each of the coefficients of this expansion (which describe the state of the field at 
each instant) obeys a differential equation which is identical to that of a harmonic 
oscillator whose angular frequency ty is that of the associated normal mode. In 
other words, the electromagnetic field is formally equivalent to a set of independent 
harmonic oscillators (cf. complement K v ). The quantization of the field is obtained 
by quantizing these oscillators associated with the various normal modes of the 
cavity. Recall, moreover, that it was the study of the behavior of these oscillators 
at thermal equilibrium (blackbody radiation) which, historically, led Planck to 



* Complement A v is devoted to a qualitative study of some physical examples of harmonic 
oscillators. 
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introduce, for the first time in physics, the constant h which bears his name. We 
shall see (cf. complement L v ) that the mean energy of a harmonic oscillator in 
thermodynamic equilibrium at the temperature T is different for classical and 
quantum mechanical oscillators. 

The harmonic oscillator also plays an important role in the description of 
a set of identical particles which are all in the same quantum mechanical state 
(they must obviously be bosons, cf. chap. XIV). As we shall see later, this is because 
the energy levels of a harmonic oscillator are equidistant, the spacing between two 
adjacent levels being equal to Tia>. With the energy level labelled by the integer n 
(situated at a distance riha> above the ground state) can then be associated a set 
of n identical particles (or quanta), each possessing an energy ha>. The transition 
of the oscillator from level n to level n 4- 1 or n — 1 corresponds to the creation 
or destruction of a quantum of energy Tuo. In this chapter, we shall introduce the 
operators a ' and a, which enable us to describe this transition from level n to 
level n + 1 or n — 1. These operators, called creation or destruction operators, are 
used throughout quantum statistical mechanics and quantum field theory*. 

The detailed study of the harmonic oscillator in quantum mechanics is 
therefore extremely important from a physical point of view. Moreover, we are 
dealing with a quantum mechanical system for which the Schrodinger equation 
can be solved rigorously. Having studied spin 1/2 and two-level systems in 
chapter IV, we shall therefore now consider another simple example which illustrates 
the general formalism of quantum mechanics. We shall show in particular how 
to solve an eigenvalue equation by dealing only with the operators and the 
commutation relations (this technique will also be applied to angular momentum). 
We shall also study in a detailed way the motion of wave packets, particularly at the 
classical limit ( cf. complement G v on quasi-classical states). 

In §A-2, we shall review some results related to the classical oscillator 
before stating (§A-3) certain general properties of the eigenvalues of the Hamil¬ 
tonian H. Then, in §§B and C, we shall determine these eigenvalues and eigenvectors 
by introducing creation and destruction operators and using only the consequences 
of the canonical commutation relation [ X , P] = ifi, as well as the particular form 
of H. §D is devoted to a physical study of the stationary states of the oscillator 
and wave packets formed by linear superpositions of these stationary states. 


2. The harmonic oscillator in classical mechanics 


The potential energy V(x) [formula (A-l)] is shown in figure 1. The motion 
of the particle is governed by the dynamical equation: 


m 


d 2 x 

df 2 



(A-4) 


* The aim of quantum field theory is to describe interactions between particles in the relativistic 
domain, especially the interactions between electrons, positrons and photons. It is clear that creation and 
destruction operators should play an important role, since such processes are indeed observed experi¬ 
mentally (absorption or emission of photons, pair creation...). 
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FIGURE 1 

The potential energy V(x) of a one¬ 
dimensional harmonic oscillator. 
The amplitude of the classical 
motion of energy E is x M . 


The general solution of this equation is of the form : 

X = x M cos ( (Ot - (p) (A-5) 

where a> is defined by (A-3), and the constants of integration x M and (p are determined 
by the initial conditions of the motion. The particle therefore oscillates sinusoidally 
about the point O, with an amplitude x M and an angular frequency <y. 

The kinetic energy of the particle is : 



where p = m ^ is the momentum of the particle. The total energy is: 

2 < 

£=r+F=|- + - moj 2 x 2 
2m 2 

Substituting solution (A-5) into this equation, we find: 

„ 1 2 2 
E = - moj 2 x 2 


(A-7) 

(A-8) 


The energy of the particle is therefore time-independent (this is a general property 
of conservative systems) and can take on any positive (or zero) value, since x M is 
a priori arbitrary. 

If we fix the total energy E, the limits of the classical motion x = ± x M can 
be determined from figure 1 by taking the intersection of the parabola with the 
line parallel to Ox of ordinate E. At these points x = ± x M , the potential energy 
is at a maximum and equal to E, and the kinetic energy is zero. On the other hand, 
at x = 0, the potential energy is zero and the kinetic energy is maximum. 


COMMENT: 

Consider an arbitrary potential V(x) which has a minimum at 
x = x 0 (fig. 2). Expanding the function V(x) in a Taylor’s series in the 
neighborhood of x 0 , we obtain : 

F(x) = a + b(x - x 0 ) 2 + c(x — x 0 ) 3 + ... (A-9) 
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The coefficients of this expansion are given by: 



/ JC = X 0 

c=±(^ 

3! \dx 3 J X=X: 


(A-10) 


and the linear term in (x — x 0 ) is zero since x 0 corresponds to a minimum 
of V(x). The force derived from the potential V(x) is, in the neighborhood 
of Xq : 

dV 

F x = ~ = - 2b ( x - x o) - 3c ( x - x o) 2 + - (A-11) 


Since x = x 0 represents a minimum, the coefficient b is positive. 

The point x — x 0 corresponds to a stable equilibrium position for the 
particle : F x is zero for x = x 0 . Moreover, for (x — x 0 ) sufficiently small, 
F x and (x — x 0 ) have opposite signs since b is positive. 

If the amplitude of the motion of the particle about x 0 is sufficiently 
small for the term in (x — x 0 ) 3 of (A-9) [and therefore, the corresponding 
term in (x — x 0 ) 2 of (A-l 1)] to be negligible compared to the preceding ones, 
we have a harmonic oscillator since the dynamical equation can then be 
approximated by: 


d 2 x 



-2 b(x - x 0 ) 


(A-l 2) 


The corresponding angular frequency a> is related to the second derivative 
of V{x) at x = x 0 by the formula: 


o) — 




(A-l 3) 


Since the amplitude of the motion must remain small, the energy of the 
harmonic oscillator will be low. 



FIGURE 2 

In the neighborhood of a minimum, 
any potential V(x) can be approxima¬ 
ted by a parabolic potential (dashed 
line). In the potential V( x:), a classical 
particle of energy E oscillates bet¬ 
ween x { and x 2 . 
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For higher energies E, the particle will be in periodic but not sinusoidal 
motion between the limits x l and x 2 (fig. 2). If we expand the function x(t) 
in a Fourier series which gives the position of the particle, we shall find, 
not one, but several, sinusoidal terms, whose frequencies are integral 
multiples of the lowest frequency. We then say that we are dealing with an 
anharmonic oscillator. Note also that in this case, the period of the motion 
is not generally 2n/a>, where a> is given by formula (A-13). 


3. General properties of the quantum mechanical Hamiltonian 

In quantum mechanics, the classical quantities x and p are replaced respec¬ 
tively by the observables X and P, which satisfy: 

[ X , P ] = in (A-14) 

It is then easy to obtain the Hamiltonian operator of the system from (A-7): 

H = \ mo) 2 X 2 (A-15) 

Since H is time-independent (conservative system), the quantum mechanical 
study of the harmonic oscillator reduces to the solution of the eigenvalue 
equation: 

H\<p> = E\<p> (A-16) 

which is written, in the { |x> } representation: 

\ - ^ mm 2 x 2 l <p(x) = E (p(x) (A-17) 

L 2m dx 2 2 J 

Before undertaking the detailed study of equation (A-16), let us indicate 
some important properties that can be deduced from the form (A-l) of the potential 
function: 

(*') The eigenvalues of the Hamiltonian are positive. It can be shown that, in 
general (complement M m ), if the potential function V(x) has a lower bound, the 

p2 

eigenvalues E of the Hamiltonian H = — 4- V(X) are greater than the minimum 
of F(x): 

V(x) ^ V m requires E > V m (A-18) 

For the harmonic oscillator which we are studying here, we have chosen the 
energy origin such that V m is zero. 

(ii ) The eigenfunctions of H have a definite parity. This is due to the fact that 
the potential V (x) is an even function: 

V( — x) = F(x) (A-19) 

We can then (cf. complements F„ and C v ) look for eigenfunctions of H, in the { | x > } 
representation, amongst the functions which have a definite parity (in fact, we shall 
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see that the eigenvalues of H are not degenerate; consequently, the wave functions 
associated with the stationary states are necessarily either even or odd). 

(hi) The energy spectrum is discrete. Whatever the value of the total energy, 
the classical motion is limited to a bounded region of the Ox axis (fig. 1), and it can 
be shown (complement M IU ) that in this case, the eigenvalues of the Hamiltonian 
form a discrete set. 

We shall derive these properties (in a more precise form) in the following 
sections. However, it is interesting to note that they can be obtained simply by 
applying to the harmonic oscillator some general theorems concerning one¬ 
dimensional problems. 


B. EIGENVALUES OF THE HAMILTONIAN 

We are now going to study the eigenvalue equation (A-16). First of all, using only 
the canonical commutation relation (A-14), we shall find the spectrum of the 
Hamiltonian //written in (A-15). 


1. Notation 

We shall begin by introducing some useful notations. 

a. THE X AND P OPERATORS 

The observables X and P obviously have dimensions (those of a length and 
a momentum, respectively). Since a> has the dimensions of the inverse of a time and h, 
of an action (product of an energy and a time), it is easy to see that the obser¬ 
vables X and P defined by: 



P=-J=P (B-l) 

V mhoj 


are dimensionless. 

If we use these new operators, the canonical commutation relation will be 
written: 


[X, P] = i 

(B-2) 

and the Hamiltonian can be put in the form: 


H =ha)H 

(B-3) 

with: 


H =±(X 2 + P 2 ) 

(B-4) 
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We shall therefore seek the solutions of the eigenvalue equation: 

=e v \<pi> (B-5) 

A 

where the operator H and the eigenvalues e v are dimensionless. The index v can 
belong to either a discrete or a continuous set, and the additional index /' enables us 
to distinguish between the various possible orthogonal eigenvectors associated 
with the same eigenvalue e v . 

b. THE a. a* AND N OPERATORS 

If X and P were numbers and not operators, we could write the sum X 2 + P 2 
appearing in expression (B-4) for H in the form of a product of linear terms : 

(X — iP)(X + iP). In fact, since X and P are non-commuting operators, X 2 + P 2 
is not equal to (X — iP)(X + iP). We shall show, however, that the introduction 
of operators proportional to X + iP and X — iP enables us to simplify considerably 

our search for eigenvalues and eigenvectors of H. 

We therefore set*: 

a = -4= (X + 'h (B-6-a) 

v 2 

a' = -^{X - iP) (B-6-b) 

v 2 

These formulas can immediately be inverted to yield: 

+ «) ( B - ? - a ) 

v 2 

P = -i= (a 1 - a) (B-7-b) 

v 2 

Since X and P are Hermitian, a and n* are not (because of the factor i), but are 
adjoints of each other. 

The commutator of a and is easy to calculate from (B-6) and (B-2) : 

[>, a*] =\[X + iP, X - iP] 

= { [A *] - { [*, P] (B-8) 

that is: 

[a, a’] = 1 (B-9) 

This relation is completely equivalent to the canonical commutation relation (A-14). 

* Until now, we have designated operators by capital letters. However, to conform to standard 
usage, we shall use the small letters a and a f for the operators (B-6). 
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Finally, let us derive some simple formulas which will be useful to us in the rest 
of this chapter. Let us first calculate a'a : 


a'a =^(X - iPlX + iP) 

= — (X 2 + P 2 + iXP - iPX) 

= \{X 2 + P 2 - 1) 

Comparing this with expression (B-4), we see that: 
H = a'a + j - j (X - iP^X + iP) + i 


(B-10) 


(B-ll) 


Unlike the situation in the classical case, H cannot be put in the form of a product 

of linear terms. The non-commutativity of X and P is at the origin of the additional 
term 1/2 which appears on the right-hand side of (B-ll). Similarly, it can be shown 
that : 


H = aa'~ j 

Let us now introduce the operator N defined by: 
N = a'a 

This operator is Hermitian since: 

N' = a*(a t ) t = = N 
Moreover, according to (B-ll): 

H = N +± 


(B-12) 


(B-13) 


(B-14) 


(B-15) 


so that the eigenvectors of H are eigenvectors of N, and vice versa. 
Finally, let us calculate the commutators of N with a and a *: 

[./V, a] = [a'a, a] = a'[a, a] + [a', a~\a = — a 
[A^, a'~\ = [a'a, a'\ = a'[a, a f ] -I- [a', a'~\ a = a' 


that is: 


[N, a] = - a 
[A^, a'~\ = a' 


(B-16) 


(B-17-a) 

(B-17-b) 


Our study of the harmonic oscillator will be based on the use of the a, a' and 
N operators. We have replaced the eigenvalue equation of H, which we first wrote 
in the form (B-5), by that of N : 


n (ply = v q>ly 


(B-18) 
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When this equation is solved, we shall know that the eigenvector | ) of N is 

also an eigenvector of H with the eigenvalue E v = (v + 1/2 )ha> [formulas (B-3) 
and (B-15)]: 

H I (Pi > = (V + 1/2 )Tuo | <p[ > (B-19) 

The solution of equation (B-18) will be based on the commutation relation (B-9), 
which is equivalent to the initial relation (A-14), and on formulas (B-17), which 
are consequences of it. 


2. Determination of the spectrum 

a. LEMMAS 

a. Lemma I (property of the eigenvalues of N) 

The eigenvalues v of the operator N are positive or zero. 

Consider an arbitrary eigenvector | <p' v > of N. The square of the norm of the 
vector a \ is positive or zero : 

\\<>\<pi>\\ 2 = <q>l\a*a\q>l> >0 (B-20) 

Let us then use definition (B-13) of N: 

< (pi | ala | (pi > = < <pi | N | q>[ > = V < cpl | (pi > (B-21) 

Since < (pl\<p ' v ) is positive, comparison of (B-20) and (B-21) shows that : 

v > 0 (B-22) 

(3. Lemma II (properties of the vector a [ (p ' v )) 

Let \ (p[y be a (non-zero) eigenvector of N with the eigenvalue v. 

We shall prove the following: 

(/) If v = 0, the ket a \ (p' v = 0 > is zero. 

(ii) If v > 0, the ket a | (p l v > is a non-zero eigenvector of N with the eigen¬ 
value v — 1. 

(t) According to (B-21), the square of the norm of «|<pj> is zero if v = 0; 
now, the norm of a vector is zero if and only if this vector is zero. Consequently, 
if v = 0 is an eigenvalue of N, all eigenvectors | (p^ > associated with this eigen¬ 
value satisfy the relation: 

a | (p l 0 > = 0 (B-23) 

Let us now show that relation (B-23) is characteristic of these eigenvectors. 
Consider a vector | (p ) which satisfies : 

a | q> y =0 (B-24) 

Multiply both sides of this equation from the left by a x : 

a'a\cpy = N\<py = 0 (B-25) 
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Any vector which satisfies (B-24) is therefore an eigenvector of A with the 
eigenvalue v = 0. 

(ii) Now let us assume that v is strictly positive. According to (B-21), the 
vector a \ (p[ ) is then non-zero, since the square of its norm is not equal to zero. 

Let us show that a \ (p‘ v > is an eigenvector of A. To do this, let us apply the 
operator relation (B-17-a) to the vector | <^>* >: 

[A, a] | <p‘ > = - a | q>l > 

Na | (pi > = aN \ ((>[ > — a | > (B-26) 

= av | (pi > - a | (pi > 

Therefore: 

%k;>] = (v- i)[«K>] (b- 27 ) 

which shows that a \ tp[ ) is an eigenvector of A with the eigenvalue v — 1. 

y. Lemma III (properties of the vector a * | >) 

Let | (p‘ v } be a (non-zero) eigenvector of A of eigenvalue v. 

We shall prove the following: 

(/) | >' s always non-zero. 

(//) a'\(pi} is an eigenvector of A with the eigenvalue v + 1. 

(/) It is easy to calculate the norm of the vector a* | ), using formulas (B-9) 

and (B-13): 

Kki> II 2 = 

= <cp‘ v \(N+ l)|«p:> 

= (v + iXvtki) (B-28) 

Since, according to lemma I, v is positive or zero, the ket a 1 |<p‘> always has 
a non-zero norm and, consequently, is never zero. 

(ii) The proof of the fact that a'|<?0 is an eigenvector of A is analogous 
to that of lemma II; starting from relation (B-17-b) between operators, we obtain : 

[A, at] | (pi > = at | p- > 

Aa» | tpi > = a»A | tp\ > + a» | q>[ > = (v + l)a» | > (B-29) 


b. THE SPECTRUM OF N IS COMPOSED OF NON-NEGATIVE INTEGERS 

Consider an arbitrary eigenvalue v of A and a non-zero eigenvector | <p‘ > 
associated with this eigenvalue. 

According to lemma I, v is necessarily positive or zero. First, let us assume v 
to be non-integral. We are now going to show that such a hypothesis contradicts 
lemma I and must consequently be excluded. If v is non-integral, we can always 
find an integer n > 0 such that : 

n < v < n + 1 (B-30) 
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Now let us consider the series of vectors: 

K>, ... a n l < Pv> (B-3i) 

According to lemma 11, each of the vectors a p | <p' > of this series (with 0 < p < n) 
is non-zero and an eigenvector of N with the eigenvalue v — p (cf. fig. 3). The proof 
is by iteration : | > is non-zero by hypothesis; a \ <p‘> is non-zero (since v > 0) 

and corresponds to the eigenvalue v — 1 of N ...; a p \ q >\) is obtained when a acts 
on a p ~ l | <p‘>, an eigenvector of N with the strictly positive eigenvalue v — p + 1, 
since p < n and v > n [cf. (B-30)]. 


II i I 


—i-1-1— 

0 1 2 

cf\ V>t> 

FIGURE 3 


- 1 - 1 - 

n n + 1 

a I (pi > | <pi > 


-► 


Letting a act several times on the ket | <p‘ v >, we can construct eigenvectors of N with eigenvalues 

v — 1, v — 2, etc... 


Now let a act on the ket a" \<p‘ y }. Since v — n > 0 according to (B-30), the 
action of a on <f |<pj> (an eigenvector of N with the eigenvalue v — n > 0) yields 
a non-zero vector (lemma II). Moreover, again according to lemma II, a n+l |<pj> 
is an eigenvector of N with the eigenvalue v — n — 1, which is strictly negative 
according to (B-30). If v is non-integral, we can therefore construct a non-zero 
eigenvector of N with a strictly negative eigenvalue. Since this is impossible, 
according to lemma I, the hypothesis of non-integral v must be rejected. 

What now happens if: 

v = n (B-32) 

with n a positive integer or zero? In the series of vectors (B-31), a"|<p‘> is 
non-zero and an eigenvector of N with the eigenvalue 0. According to lemma II (§/'), 
we therefore have: 

a n+1 K>= 0 (B-33) 

The series of vectors obtained by repeated action of the operator a on | (p' n ) is 
therefore limited when n is integral. It is then never possible to obtain a non-zero 
eigenvector of N which corresponds to a negative eigenvalue. 

In conclusion, v can only be a non-negative integer. 

Lemma III can then be used to show that the spectrum of N indeed includes 
all positive or zero integers. We have already constructed an eigenvector of N with 
an eigenvalue of zero ( a" | (p' n )). All we must do is let (a'f act on such a vector in 
order to obtain an eigenvector of N of eigenvalue k, where k is an arbitrary positive 
integer. 
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If we then refer to formula (B-19), we conclude that the eigenvalues of H 
are of the form: 



with n = 0, 1, 2, ... Therefore, in quantum mechanics, the energy of the harmonic 
oscillator is quantized and cannot take on any arbitrary value. Note also that the 
smallest value (the ground state) is not zero, but Tuo/ 2 (see §D-2 below). 


c. INTERPRETATION OF THE a AND a< OPERATORS 

If we start with an eigenstate | q?*) of H corresponding to the eigen¬ 
value E n = (n + l/lfuo, application of the operator a yields an eigenvector 
associated with the eigenvalue E n _ l = (n + \/2)Tuo — Tuo, and application of a 1 
yields, in the same way, the energy E n+1 = (n + 1/2 fta> + Tuo. 

For this reason, a is said to be a destruction operator and a\ a creation 
operator: their action on an eigenvector of N makes an energy quantum Tuo appear 
or disappear. 


3. Degeneracy of the eigenvalues 


We shall now show that the energy levels of the one-dimensional harmonic 
oscillator, given by equation (B-34), are not degenerate. 


a. THE GROUND STATE IS NON-DEGENERATE 

The eigenstates of H associated with the eigenvalue E 0 = Tuo/ 2, that is, the 
eigenstates of N associated with the eigenvalue n = 0, according to lemma II 
of § 2-a-/?, must all satisfy the equation: 

a | <Po > = 0 (B-35) 

To find the degeneracy of the E 0 level, all we must do is see how many 
linearly independent kets satisfy (B-35). 

Using definition (B-6-a) of a and relations (B-l), we can write (B-35) in the 

form: 

Tit 

In the { |x) } representation, this relation becomes: 

^f x+ -t) v ' M = 0 < B - 37 > 



X + 


V mhco 


3 K > = o 


(B-36) 
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where: 

<Po( x ) = < x l<P'o> (B-38) 

Therefore we must solve a first-order differential equation. Its general solution is : 

_ 1 mcj 2 

<p‘ 0 (x) = c e 2 h x (B-39) 

where c is the constant of integration. The various solutions of (B-37) are all 
proportional to each other. Consequently, to within a multiplicative factor, there 
exists only one ket | <p 0 ) which satisfies (B-35): the ground state E 0 = fico/2 is 
not degenerate. 


b. ALL THE STATES ARE NON-DEGENERATE 

We have just seen that the ground state is not degenerate. Let us show by 
recurrence that this is also the case for all the other states. 

To do this, all we need prove is that, if the level E n = (n + l/2)ha> is not 
degenerate, the level E n+ , =(« + ! + l/2)Tuo is not either. Let us therefore assume 
that there exists, to within a constant factor, only one vector | (p n ) such that: 

N\<p n > = n\<p„y (B-40) 

Then consider an eigenvector |<jo‘ + ,> corresponding to the eigenvalue?! + 1 : 
W|<?‘ + 1 > - (« + + i > (B-41) 

We know that the ket a | q> l „ + x > is not zero and that it is an eigenvector of N with 
the eigenvalue n (cf. lemma II). Since this ket is not degenerate by hypothesis, there 
exists a number c‘ such that: 

a\<p i n+i> = c i \cp„> (B-42) 

It is simple to invert this equation by applying a' to both sides: 

a*a | (p‘ n+l > = cV | (p n > (B-43) 


that is, taking (B-13) and (B-41) into account: 

I -pU. > (b- 44 ) 

We already knew that a 1 \ q> ) was an eigenvector of N with the eigenvalue (« + 1); 
we see here that all kets | tp l n+ 1 ) associated with the eigenvalue (« + 1) are propor¬ 
tional to a* | (p n >. They are therefore proportional to each other : the eigen¬ 
value (n + 1) is not degenerate. 

Thus, since the eigenvalue n = 0 is not degenerate (see § a), the eigen¬ 
value n = 1 is not either, nor is n = 2, etc... : all the eigenvalues of N and, conse¬ 
quently, all those of H, are non-degenerate. This enables us to write simply | <p n > 
for the eigenvector of H associated with the eigenvalue E n = (n + l/2)Tta>. 
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C. EIGENSTATES OF THE HAMILTONIAN 

In this section, we are going to study the principal properties of the eigenstates 
of the operator N and of the Hamiltonian H. 


1. The {!<?„>} representation 

We shall assume that N and H are observables, that is, their eigenvectors 
constitute a basis in the space S x , the state space of a particle in a one-dimensional 
problem (this could be proved by considering the wave functions associated with 
the eigenstates of N, which we shall calculate in §2 below). Since none of the 
eigenvalues of N (or of H) is degenerate (see §B-3), N (or H) alone constitutes 
a C.S.C.O. in S x . 


a. THE BASIS VECTORS IN TERMS OF | <p 0 > 

The vector | (p 0 ) associated with n = 0 is the vector of $ x which satisfies: 
a | (p 0 > = 0 (C-l) 

It is defined to within a constant factor; we shall assume | (p 0 ) to be normalized, 
so the indeterminacy is reduced to a global phase factor of the form e ie , with d real. 

According to lemma III of § B-2-a, the vector | <p, ) which corresponds to n = 1 
is proportional to a* \ (p 0 ): 

\<Pi > = c l a 1 \(p 0 } (C-2) 

We shall determine c t by requiring | q> l ) to be normalized and choosing the phase 
of | (p l ) (relative to |<p 0 >) such that c t is real and positive. The square of the 


norm of | (p i ), according to (C-2), is equal to: 

< <Pi | <Pi > = hi 2 < <Po I aa ' I <Po > 

= k| 2 < (p 0 | ( at ° + 0 | <Po > ( C ' 3 ) 

where (B-9) has been used. Since |q> 0 > is a normalized eigenstate of N = a*a with 
the eigenvalue zero, we find: 

<«p, k. > = M 2 = i (c-4) 

With the preceding phase convention, we have c l = 1 and, consequently: 

ki > = 0 f ko> ( c ‘ 5 ) 

Similarly, we can construct | q> 2 > from | ) : 

\(p 2 > = c 2 a'\(p t > (C-6) 
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We require | (p 2 > to be normalized and choose its phase such that c 2 is real and 
positive: 

< <P 2 | <P 2 > = h| 2 < <Pi | aa' | (p t > 

= hi 2 < (p x | (a f a + 1) | <p, > 

= 2 |c 2 | 2 = 1 (C-7) 

Therefore: 


\<f >2 > = 1 > = -Jp (a') 2 | <p 0 > 

(C-8) 

if we take (C-5) into account. 

This procedure can easily be generalized. If we know 
normalized), then the normalized vector | (p tt > is written: 

) (which is 

\<P„> = c„a'\<p m - t > 

(C-9) 

Since: 


< <Pn | (P„ > = h| 2 < <p n -! an f | 9> n -i > 

=» k„| 2 = i 

(C-10) 

we choose, with the same phase conventions as above : 


ft 

a 

li 

X- 

(C-ll) 

With these successive phase choices, we can obtain all the 

k„> from | <p 0 >: 

| <Pn ) = 7= | 9>n- 1 > = r f WY \ <Pn-2 > = - 

v h v w v n — 1 


=4 r ~— o> 
v n \j n — 1 v 2 

that is : 

(C-12) 

k„> =4-(« t rko> 

v n ! 

(C-13) 


b. ORTHONORMALIZATION AND CLOSURE RELATIONS 

Since H is Hermitian, the kets | (p n > corresponding to different values of n 
are orthogonal. Since each of them is also normalized, they satisfy the orthonorma¬ 
lization relation: 

< 9V I <Pn > = <5„n' (C-14) 

In addition, H is an observable (we shall assume this here without proof); 
the set of the | (p n ) therefore constitutes a basis in 8 x . This is shown in the closure 
relation: 

Ek»x^l=i (c-i5) 

N 
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COMMENT : 

It can be verified directly from expression (C-13) that the kets | <p n > are ortho¬ 
normal : ** 

< <f>n -1 <P« > = 7== < <Po I a " a f " I <Po > (C-16) 

Vn!»'! 

But: 

d' a*" | <P 0 > = a "' -1 (oat) at "" 1 | <,p 0 > 

= a"' _1 (atfl + 1) at n ~ 1 | (p 0 ) 

= na"'- 1 at»-«| <? > 0 > (C-17) 

(using the fact that a 1 " -1 (p 0 ) is an eigenstate of N = a'a with the eigenvalue n — 1). 
Thus can we reduce the exponents of a and a f by iteration. We obtain, finally: 

if n < n' : < <p 0 | a n V" | <p 0 > = n x (n - 1) x ... 2 x 1 < <p 0 |fl" | <P 0 ) (C-18-a) 

if n > n' : < <p 0 |« tn | <P 0 ) 

= nx (n- 1)... (n - n' + 1)< <p 0 I (« , )' , ” n ’ ko > (C-18-b) 

if n = ri : (<p 0 \a n a tn \<p o y = n x (n — 1) x ...2 x 1 <<p 0 k 0 > (C-18-c) 

The expression (C-18-a) is zero because a | <p 0 ) = 0. Similarly, (C-18-b) is equal to zero 
because < <p 0 | (a*)" - " | (p 0 ) can be considered to be the scalar product of | <p 0 ) and the 
bra associated with a n ~ n \ (p 0 ), which is zero if n > Finally, if we substitute (C-18-c) 
into (C-16), we see that ( (p„\ <p„} is equal to 1. 


c. ACTION OF THE VARIOUS OPERATORS 

The observables X and P are linear combinations of the operators a and a} 
[formulas (B-l) and (B-7)]. Consequently, all physical quantities can be expressed 
in terms of a and a 1 . Now, the action of a and a * on the | q>„ ) vectors is especially 
simple [see equations (C-19) below]. In most cases, it is therefore desirable 
to use the { | q> n > } representation to calculate the matrix elements and mean 
values of the various observables. 

With the phase conventions introduced in §a above, the action of the a 
and a' operators on the vectors of the { \<p H ) } basis is given by: 


a'\({> n y = Jn + 1 I <jp n+ i > 

a | <P n > = Vn | <P„-i > 


(C-19-a) 

(C-19-b) 


We have already proved (C-19-a): it suffices to replace n by n + 1 in equations (C-9) 
and (C-ll). To obtain (C-19-b), multiply both sides of (C-9) on the left by the 
operator a and use (C-l 1): 


| > =-^(a'a + 1) 

n v n 


<P n 


.y> = V~n | ^. 1 > (C-20) 
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COMMENT: 

The adjoint equations of (C-19-a) and (C-19-b) are: 

<<Pn \a = Jn + 1 < <p n+ , | (C-21-a) 

< (f> n | = sTn < 9.-1 | (C-21-b) 

Note that a decreases or increases n by one unit depending on whether it 

acts on the ket | <p„ > or on the bra < <p„ |. Similarly, a 1 increases or decreases n 

by one unit, depending on whether it acts on the ket | q> n ) or on the bra < q> n |. 

Starting with (C-19) and using (B-l) and (B-7), we immediately find the 
expressions for the kets X \ cp n ) and P \ ): 



(C-22-a) 


= W n + 1 I <P«+1 > - >/» | <Pn -1 >] 


(C-22-b) 


The matrix elements of the a, a\ X and P operators in the { \<p n ) } repre¬ 
sentation are therefore: 


< | a | <p„ > = y/nS n> _ l 

<(Pn\^\(p n } = s/n + 1 S„,„ +l 

n [v'« + 1 <5 n>+1 + 


< <p„. | X | <p n > = 


2ma> 


< <Pn' I P | (Pn > = * W n + 1 J 


(C-23-a) 

(C-23-b) 

(C-23-c) 

(C-23-d) 


The matrices representing a and a' are indeed Hermitian conjugates of each other, 
as can be seen from their explicit expressions: 


(«)“ 


0 


0 v/i o 

0 0 J~2 0 
0 0 0 


0 0 0 0 0 


(C-24-a) 
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(« f ) 


0 0 0 
yi o o 
0\/2 0 
0 0 y/i 


0 0 0 \J n + 1 0... 


(C-24-b) 


As for the matrices representing X and P, they are both Hermitian : the matrix 
associated with X is, to within a constant factor, the sum of the two preceding 
ones; the matrix associated with P is proportional to their difference, but the 
presence of the factor / in (C-22-b) re-establishes its Hermiticity. 


2. Wave functions associated with the stationary states 

We shall now use the { | x > } representation and write the func¬ 
tions (p n {x ) = < x\(p n ) which then represent the eigenstates of the Hamiltonian. 

We have already determined the function <p 0 (x) which represents the ground 
state | (pQ > (cf §B-3-a): 

im( ° 2 

OoM = < * | <p„ > = hf) e" 3 « ' (C-25) 

The constant which appears before the exponential insures the normalization 
of <p 0 {x). 

To obtain the functions (p n (x) associated with the other stationary states 
of the harmonic oscillator, all we need to do is use expression (C-13) for the ket | (p n ) 
and the fact that, in the { | x ) } representation, n* is represented by : 

1 T jmca I ft d "1 
n X yjnuodxj 

since X is represented by multiplication by x and P, by^^" [formula (B-6-b)]. 
Thus we obtain : 

(p„{x) = < x | q>„ > = < x | (a*)" | (p 0 > 

~7TC7?[ 
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that is: 


<Pn(x) = 


i / + \""l 1/2 / \ 1/4 p J -1« 

1 / n \ / maA mco d 

2'7i!\mct)/_ \7rfi/ fi dx_ 



(C-27) 


_ 1 mco 2 

It is easy to see from this expression that <p n (x) is the product of e 2 h x and a 
polynomial of degree n and parity (— 1)" called a Hermite polynomial (cf com¬ 
plements B v and C v ). 

A simple calculation gives the first several functions <p n (x): 


<Pi{x) = 


cp 2 {x) = 



(C-28) 


These functions are shown in figure 4 and the corresponding probability densities, 
in figure 5. Figure 6 gives the shape of the wave function ep„(x) and that of the 
probability density |<p„(x)| 2 for n = 10. 

We see from these figures that when n increases, the region of the Ox axis 
in which <p„(x) takes on non-negligible values becomes larger. This corresponds 
to the fact, in classical mechanics, that the amplitude of the particle’s motion 
increases with the energy [cf. fig. 1 and relation (A-8)]. It follows that the mean 
value of the potential energy grows with n [cf. comment (ii) of §D-1], since <p„(x), 
when n is large, takes on non-negligible values in regions of the x-axis where F(x) 
is large. Moreover, we see in these figures that the number of zeros of (p„(x) is n 
(cf. complement B v , where this property is derived). This implies that the mean 
kinetic energy of the particle increases with n [cf. comment (ii) of §D-1], since this 
energy is given by : 

f < r 2 > = - d* (C-29) 

2m 2 mj_ 00 dx^ 


When the number of zeros of (p n (x) increases, the curvature of the wave function 

d ^ 

increases, and. in (C-29), the second derivative—- <p„(x) takes on larger and larger 

dx 2 

values. 

Finally, when n is large, we observe (see, for example, figure 6) that the 
probability density |<p„(x)| 2 is large for x ^ ± x M [where x M is the amplitude of 
the classical motion of energy E n ; cf. (A-8)]. This result is related to a feature of the 
motion predicted by classical mechanics: the classical particle has a zero velocity 
at x = ± x M ; therefore, on the average, it spends more time in the neighborhood 
of these two points than in the center of the interval — x M < x < x M . 
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FIGURE 4 

Wave functions associated with the first three levels of a harmonic oscillator. 



0 1 2 


FIGURE 5 

Probability densities associated with the first three levels of a harmonic oscillator. 
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D. DISCUSSION 


1. Mean values and root-mean-square deviations 
of X and P in a state | (p„ > 


Neither X nor P commutes with //, and the eigenstates \<p„} of H are not 
eigenstates of X or P. Consequently, if the harmonic oscillator is in a stationary 
state \(p n ), a measurement of the observable X or the observable P can, a priori , 
yield any result (since the spectra of X and P include all real numbers). We shall 
now calculate the mean values of X and P in such a stationary state and then 
their root-mean-square deviations AX and AP, which will enable us to verify 
the uncertainty relation. 

As we indicated in §C-l-c, we shall perform these calculations with the 
help of the operators a and a\ As far as the mean values of X and P are concerned, 
the result follows directly from formulas (C-22), which show that neither X nor P 
has diagonal matrix elements: 

<<Pn\*\<Pn> =° 

<<Pn\P k„>=0 (D-l) 

To obtain the root-mean-square deviations AX and AP, we must calculate 
the mean values of X 2 and P 2 : 

(ax ) 2 = \x 2 \ <p n > - (< <p n \x\<p n » 2 = (<p n \x 2 \<p n y 

(AP) 2 = < <p n \P 2 \ <p n > - « <p n | P K » 2 = < <Pn\ P 2 \<P n > (P-2) 


But, according to (B-l) and (B-7): 
h 


x 2 = 


2 mo) 

n 

2mto 


(n* + a)(a f + a) 

(a t2 + aa* + a*a + a 2 ) 


P 2 = — (a* - a)(a' — a) 


mh(o 


(a t2 — aa f — a'a + a 2 ) 


(D-3) 


The terms in a 2 and a' 2 do not contribute to the diagonal matrix elements, 
since a 2 \<p n } is proportional to |<A,- 2 > ant ^ a ' 2 \ ^«) to I ^+ 2 ) both of which 
are orthogonal to | (p n ). On the other hand : 


< <Pn |(« f « + aa f ) I <p„ > = < (p n I (2ata + 1) | q>„ > 


= 2n + 1 

Consequently: 

)— 

1 mco 

(CM) 

(JA0 J = < V, | | V. > = | 

f ' 

(D-5-a) 

(JP) 2 = < 9, | P ! | 9. > = | 

( c 

V 2 y 

| mfia> 

(D-5-b) 
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The product AX. AP is therefore equal to: 


AX.AP= 



(D-6) 


We again find ( cf. complement C,„) that it is greater than or equal to h/2. In fact, 
this lower bound is attained for n = 0, that is, for the ground state (§ 2 below). 


comments: 

(/) If x M denotes the amplitude of the classical motion whose energy 
is E n = (n + 1/2 )Thd, it is easy to see, using (A-8) and (D-5-a), that: 

AX=-^x m (D-7) 

v 2 

Similarly, if p M denotes the oscillation amplitude of the corresponding 
classical momentum: 


Pm = rruo x M (D-8) 

we obtain: 

AP = -L p M (D-9) 

V2 

It is not surprising that AX is of the order of the interval [— x M , + x M ] 
over which the classical motion occurs (cf. fig. 1): we saw at the end 

of §C, that it is approximately inside this interval that <p„(.x) takes on non- 

negligible values. Also, it is easy to understand why, when n increases, AX does 
also. For large n , the probability density |<^ B (x)| 2 has two symmetric peaks 
situated approximately at x = ± x M . The root-mean-square deviation 
cannot then be much smaller than the distance between the peaks, even if 
each of them is very sharp (cf. chap. Ill, § C-5 and the discussion of § 1-b of 
complement A,„). An analogous argument can be set forth for AP (cf. com¬ 
plement D v ). 

(ii) The mean potential energy of a particle in the state | <p n > is: 

(V(X)}=jma> 2 (X 2 y (D-10) 

that is, since < X} is zero [cf. (D-l)]: 

< F(X)> =^ma> 2 (AX) 2 (D-ll) 

Similarly, we could find the mean kinetic energy of this particle : 
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Substituting relations (D-5) into (D-l I) and (D-12). we obtain: 
<V(X)>=\(n+j\hco=^ 

<S> ^(" + i) to =f < D - 13 > 


The mean potential and kinetic energies are therefore equal. This is an 
illustration of the virial theorem (cf. exercise 10 of complement L,„). 

(iii ) A stationary state | cp n ) has no equivalent in classical mechanics; its energy 
is not zero although the mean values < X ) and <P> are. Nevertheless, 
there is a certain analogy between the state | <p n ) and that of a classical 
particle whose position is given by (A-5) [where x M is related to the energy E n 
by relation (A-8)], but for which the initial phase cp of the motion is chosen 
at random (all values included between 0 and 2 n have the same probability). 
The mean values of x and p are then zero, since: 


■' = 2 * 1 


cos (cot — <fi) dip — 0 


sin (tot — cp) dtp = 0 


(D-l 4) 


Moreover, we find, for the root-mean-square deviations of the position and 
the momentum, values identical to those of the state | cp n > [formulas (D-7) 
and (D-9)] : 

x o = x m [ cos 2 (cot - cp) dcp = ^ 

Jo 

Pel = Pm 2~ f si ° 2 M - (p) &<P = -y (D-15) 

1 Jo 

that is: 


Sx cl = yj x 2 - (x d ) 2 


<>Pcl = V Pci ~ (Pci ) 2 



Pm 


(D-l 6) 


2. Properties of the ground state 

In classical mechanics, the lowest energy of the harmonic oscillator is obtained 
when the particle is at rest (zero momentum and kinetic energy) at the x-origin 
(x = 0 and therefore zero potential energy). The situation is completely different 
in quantum mechanics : the minimum energy state is | (p 0 ), whose energy is not 
zero, and the associated wave function has a certain spatial extension, characterized 

t 

by the root-mean-square deviation AX = yjH/lmco. 
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This essential difference between the quantum and classical results can be 
seen to have its source in the uncertainty relations, which forbid the simultaneous 
minimization of the kinetic energy and the potential energy. As we pointed out 
in complements C, and M,„, the ground state corresponds to a compromise in 
which the sum of these two energies is as small as possible. 

In the special case of a harmonic oscillator, it is possible to state these 
qualitative considerations semi-quantitatively, and thus find the order of magnitude 
of the energy and the spatial extension of the ground state. If the distance £ 
characterizes this spatial extension, the mean potential energy will be of the 
order of: 

V ~ ~ moj 2 £ 2 (D-17) 


But AP is then equal to about so the mean kinetic energy is approximately: 

- J 2 n 2 

T = — ~- 

2m 2 m£ 2 

The order of magnitude of the total energy is therefore: 


(D-18) 


E = T + V si + - ma) 2 £ 2 
2m{ 2 2 


(D-19) 



FIGURE 7 

Variation of the potential energy V and of the kinetic 
energy T with respect to a parameter f which characte¬ 
rizes the spatial extension of the wave function about 
v = 0. Since the potential energy is at a minimum at 
x = 0, V is a function which increases with V oc £ 2 ). 
On the other hand, according to Heisenberg’s uncer¬ 
tainty relation, the kinetic energy T is a decreasing 
function of The lowest possible total energy, 
obtained for £ = £ m , results from a compromise in 

which the sum T 4- V is at a minimum. 


The variation of T, V and E with respect to £ is shown in figure 7. For small 
values of £, T prevails over V; the opposite occurs for large values of <!;. The 
ground state therefore corresponds approximately to the minimum of the function 
(D-19); it is easy to see that this minimum occurs at: 



and is equal to: 
E„ — Tuo 

(It 


(D-20) 

(D-21) 


We again find the correct orders of magnitude of E 0 and AX in the state | (p 0 ). 
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The harmonic oscillator possesses the pecularity that, because of the form 
of the potential V{x), the product AX. AP actually attains its lower bound, ft/2, 
in the ground state | (p 0 ) [formula (D-6)]. This is related to the fact (cf. 
complement C,„) that the wave function of the ground state is Gaussian. 


3. Time evolution of the mean values 

Consider a harmonic oscillator whose state at / = 0 is : 

| <M0) > = Z C »(°) I <Pn > ( D ‘ 22 ) 

n = 0 

(|i/f(0)> is assumed to be normalized). Its state | > at t can be obtained 

by using rule (D-54) of chapter III: 

l <a(o > = z c M^' En,lh \<p n y 

n - 0 

= Z c n (0)e“ i (" + l) <o >, 1 > (D-23) 

>1 = 0 

The mean value of any physical quantity A is therefore given as a function 
of time by: 

ao oo 

< i//(0 | A | #) > = Z Z ^(0K(0) A mn e i(m_ " ) “ t (D-24) 

m = 0 n-0 

with: 

A mn = < <Pm H | <t>n > (D-25) 

Since m and n are integers, the time evolution of the mean values involves only 
the frequency on/ln and its various harmonics, which constitute the Bohr frequencies 
of the harmonic oscillator. 

Let us consider, in particular, the mean values of the observables X and P. 
According to formulas (C-22), the only non-zero matrix elements X mn and P mn 
are those for which m = n ± 1. Consequently, the mean values of X and P 
include only terms in e*'"'; they are sinusoidal functions of time with angular 
frequency to. This obviously relates to the classical solution of the harmonic 
oscillator problem. Moreover, as we pointed out in the discussion of Ehrenfest’s 
theorem (chap. Ill, §D-l-d-y), the form of the harmonic oscillator potential 
implies that for all | ^ > the mean values of X and P rigorously satisfy the 
classical equations of motion. Thus, according to general formulas (D-34) and 
(D-35) of chapter III : 

!<*>= 4< [*,«]> (D-26-a) 

^<P> =-|<[P,H]> = - met) 2 < X > (D-26-b) 
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If we integrate these equations, we obtain: 

< x >(t) = < X >(0) cos cat + — < P >(0) sin cot 

' mo) 

< P >(t) = < P >(0) cos o)t — mco < X XO) sin car (D-27) 

We again find the sinusoidal form indicated by formula (D-24). 


comment : 

It is important to note that this analogy with the classical situation appears 
only when | ip(0 )) is a superposition of states | q>„ > of the type of (D-22), where 
several coefficients c„(0) are non-zero. If all these coefficients except one are 
equal to zero, the oscillator is in a stationary state and the mean values of all 
the observables are constant over time. 

It follows that, in a stationary state | ), the behavior of a harmonic 

oscillator is totally different from that predicted by classical mechanics, even if n is 
very large (the limit of large quantum numbers). If we want to construct a wave 
packet whose average position oscillates over time, we must superpose different 
states | q>„ ) (see complement G v ). 


References and suggestions for further reading : 

Dirac (1.13), §34; Messiah (1.17), chap. XII. 
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COMPLEMENTS OF CHAPTER V 


A v • SOME EXAMPLES 

OF HARMONIC OSCILLATORS 

A v : demonstrates, in some examples chosen 
from various fields, the importance of the quan¬ 
tum mechanical harmonic oscillator in physics. 
Semi-quantitative and rather simple; recom¬ 
mended for a first reading. 

B v : STUDY OF THE STATIONARY STATES 

IN THE ||r>) REPRESENTATION. 

HERMITE POLYNOMIALS 

B v : technical study of the stationary wave 
functions of the harmonic oscillator. Intended 
to serve as a reference. 

C v : SOLVING THE EIGENVALUE EQUATION 

OF THE HARMONIC OSCILLATOR 

BY THE POLYNOMIAL METHOD 

C v : another method which yields the results 
of chapter V. Reveals the relation between 
energy quantization and the behavior of the 
wave functions at infinity. Moderately difficult. 

D v : STUDY OF THE STATIONARY STATES 

IN THE | | p > | REPRESENTATION. 

D v : shows that, in a stationary state of the 
harmonic oscillatoj, the momentum probability 
distribution has the same form as that of posi¬ 
tion. Fairly simple. 

E v : THE ISOTROPIC 

THREE-DIMENSIONAL 

HARMONIC OSCILLATOR 

E v • generalization of the results of chapter V to 
three dimensions. Recommended for a first rea¬ 
ding, since it is simple and important. 

F v : A CHARGED HARMONIC OSCILLATOR 

PLACED IN A UNIFORM 

ELECTRIC FIELD 

F v : a direct and simple application of the results 
of chapter V (except for §3, which uses the trans¬ 
lation operator introduced in complement E„). 
Recommended for a first reading. 

G v : COHERENT "QUASI-CLASSICAL" STATES 

OF THE HARMONIC OSCILLATOR 

G v : detailed study of the “quasi-classicaP 
states of the harmonic oscillator, which illus¬ 
trates the relation between quantum and clas¬ 
sical mechanics. Important because of its appli¬ 
cations to the quantum theory of radiation. 
Moderately difficult; can be omitted in a first 
reading. 

Hy : NORMAL VIBRATIONAL MODES 

OF TWO COUPLED 

HARMONIC OSCILLATORS 

H v : study, in the very simple case of two coupled 
harmonic oscillators, of normal vibrational 
modes of a system. Recommended, since it is 
simple and physically important. 
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J v : VIBRATIONAL MODES 

OF AN INFINITE LINEAR CHAIN 

OF COUPLED HARMONIC OSCILLATORS; 

PHONONS 

K v : VIBRATIONAL MODES 

OF A CONTINUOUS PHYSICAL SYSTEM. 
APPLICATION TO RADIATION; PHOTONS 


J v , K v : introduction, using simplified models, 
of concepts which are particularly important in 
physics. Rather difficult (graduate level); can be 
reserved for later study. 

J v : determination of the normal vibrational 
modes of a linear chain of coupled oscillators, 
leading to the concept of the phonon, funda¬ 
mental to solid state physics. 

K v : normal vibrational modes of a conti¬ 
nuous system. A simple way to introduce photons 
in the quantum mechanical study of the electro¬ 
magnetic field. 


L v : THE ONE-DIMENSIONAL 
HARMONIC OSCILLATOR 
IN THERMODYNAMIC EQUILIBRIUM 
AT A TEMPERATURE T 


L v : application of the density operator (intro¬ 
duced in complement E,,,) to a harmonic oscil¬ 
lator in thermodynamic equilibrium. Important 
from a physical point of view, but requires know¬ 
ledge of E m . 


M v : EXERCISES 
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Complement A v 

SOME EXAMPLES OF HARMONIC OSCILLATORS 


1. Vibration of the nuclei of a diatomic molecule 

a. Interaction energy of two atoms 

b. Motion of the nuclei 

c. Experimental observations of nuclear vibration 

2. Vibration of the nuclei in a crystal 

a. The Einstein model 

b. The quantum mechanical nature of crystalline vibrations 

3. Torsional oscillations of a molecule: ethylene 

a. Structure of the ethylene molecule f C 2 H 4 

b. Classical equations of motion 

c. Quantum mechanical treatment 

4. Heavy muonic atoms 

a. Comparison with the hydrogen atom 

b. The heavy muonic atom treated as a harmonic oscillator 

c. Order of magnitude of the energies and extension of the wave functions 


We mentioned in the introduction to chapter V that the results obtained in 
the study of the harmonic oscillator are applicable to numerous cases in physics, 
especially those concerning small oscillations of a system about a position of 
stable equilibrium (where the potential energy is at a minimum). The aim of 
this complement is to describe some examples of such oscillations and to point 
out their physical importance: vibration of the nuclei in a diatomic molecule 
or a crystalline lattice, torsional oscillations in a molecule, motion of a muon n~ 
inside a heavy nucleus. We do not intend to discuss these phenomena in great 
detail here. We shall confine ourselves to a simple, qualitative discussion. 


1. Vibration of the nuclei of a diatomic molecule 
a. INTERACTION ENERGY OF TWO ATOMS 

The formation of a molecule from two neutral atoms occurs because the 
interaction energy V(r) of these two atoms has a minimum (r is the distance 
between them). The form of V(r) is shown in figure 1. When r is very large, 
the two atoms do not interact and V(r) approaches a constant which we 
shall choose as the energy origin. Then, as r decreases, V(r ) varies approximately 
like — 1/r 6 : the corresponding attractive forces are the Van der Waals forces 
(which we shall study in complement C XI ). When r becomes so small that the 
electronic wave functions overlap, V(r ) decreases faster and passes through a 
minimum at r = r e \ it then increases and becomes very large as r approaches 
zero. 
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FIGURE I 

Form of the interaction potential between two 
atoms which can form a stable molecule. Classi¬ 
cally, V 0 is the dissociation energy of the molecule 
and r e , the distance between the two nuclei in the 
equilibrium position. In quantum mechanics, one 
obtains vibrational states (the horizontal lines 
inside the well) whose energies are all greater 
than — V n > 


The minimum of V(r) is responsible for the phenomenon of the chemical 
bond which can form between the two atoms. We have already pointed out, 
in §C-2-c of chapter IV (taking the molecule as an example), that the cause 
of this lowering of the energy is a delocalization phenomenon of the electronic 
states (quantum resonance) which allows the electrons to profit from the attraction 
of the two nuclei. The rapid rise of V(r) at small distances is due to the repulsion 
of the nuclei. 

If the nuclei were classical particles, they would have stable equilibrium 
positions separated by r = r e . The depth V 0 of the potential well at r = r e is 
called, classically, the dissociation energy of the molecule : it is, in fact, the 
energy that must be furnished to the two atoms in order to separate them. 
The larger V 0 , the more stable the molecule. 

The theoretical and experimental determination of the curve of figure 1 is a 
very important problem in atomic and molecular physics. We shall see that by 
studying the vibrations of the nuclei we get a certain amount of information 
about this curve. 

comment (the Born-Oppenheimer approximation): 

The quantum mechanical description of a diatomic molecule is actually a very 
complex problem; it involves finding the stationary states of a system of particles, the 
nuclei and the electrons, all interacting with each other. In general, it is impossible 
to solve the Schrodinger equation for such a system exactly. A significant simplication 
arises from the fact that the mass of the electrons is much smaller than that of the nuclei. 
It follows from this that one can, in a first approximation, study the two motions 
separately. One begins by determining the motion of the electrons for a fixed value of the 
distance r between the two nuclei; thus one obtains a series of stationary states for the 
electronic system, of energies E t (r), E 2 (r)... Then one considers the ground state, of 
energy E t (/•), of the electronic system; when r varies because of the motion of the nuclei, 
the electronic system always remains in the ground state, for all r. This means that the 
system’s wave function adapts itself instantaneously to any change in r : the electrons, 
which are very mobile, are said to follow “adiabatically” the motion of the nuclei. 
In the study of this motion, the electronic energy £,(r) then plays the role of a potential 
energy of interaction between these two nuclei. This interaction potential depends on the 
distance between the nuclei, r, and adds to their electrostatic repulsion Z,Z 2 e 2 /r 
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(where Z x and Z 2 are the atomic numbers of the two nuclei; we have set e 2 = q 2 /4nE 0 > 
where q is the charge of the electron). The total potential energy V(r) of the system of the 
two nuclei, which enables us to determine their motion, is then : 

V(r) = E,(r) + (1) 

It is this function that is shown in figure 1. 


b. MOTION OF THE NUCLEI 


a. Separation of the rotational and vibrational motions 


We are thus faced with a problem which involves the motion of two 
particles of masses m, and m 2 , whose interaction is described by the potential V(r) 
of figure 1, which depends only on the distance between them. The problem is 
complicated by the existence of several degrees of freedom: vibrational (variation 
of r) and rotational (variation of the polar angles 6 and <p which give the direction 
of the axis of the molecule). In addition, these degrees of freedom are coupled : 
when the molecule vibrates, its moment of inertia changes because of the variation 
of r, and the rotational energy is modified. 

If we confine ourselves to small amplitude vibrations, it can be shown that 
the coupling between vibrational and rotational degrees of freedom is negligible 
since the relative variation of the moment of inertia is very small during the 
vibration. The problem is then reduced (as we shall see in detail in complement F VII ) 
to two independent problems : in the first place, the study of the rotation of a 
“dumbbell” composed of two masses m l and m 2 separated by a fixed distance r*\ 
plus a one-dimensional problem (in which r is the only variable) involving a 
fictitious particle whose mass m is equal to the reduced mass of m, and m 2 
(if. chap. VII, § B): 


m = 


m l m 2 
m, + m 2 


( 2 ) 


moving in the potential V(r) of figure 1. We must then solve the eigenvalue 
equation: 


- 7-77 + P(0"W) = E <P( r ) 
2m dr 2 J 


(3) 


We shall concentrate on the latter problem here. 


(3. Vibrational states 

If we confine ourselves to small amplitude oscillations, we can make a 
limited expansion of V(r) in the neighborhood of its minimum, at r = r e \ 

V(r) = - V 0 + \ V"(r e )(r - r e f + | V"'(r e )(r - r e f + ... (4) 


* We shall study this system (also called a "rigid rotator”) quantum mechanically in comple¬ 
ment C VI , once we have introduced angular momentum. 
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The discussion in § A-2 of chapter V shows that if we neglect higher than second- 
order terms in expression (4), we are left with the equation of a one-dimensional 
harmonic oscillator centered at r = r e , of angular frequency: 


to 


7 ) 

V m 


(5) 


The vibrational states | cp v ), shown by the horizontal lines in figure 1, 
therefore have energies given by: 



where v — 0, 1, 2, ... (i? is used instead of n in the notation of molecular 
vibrations). 

According to the discussion in §D-3 of chapter V, the mean value < R ) of 
the distance between the two nuclei oscillates about r e with a frequency of 
a>/ 27 t which can thus be seen to be the vibrational frequency of the molecule. 


comments : 


(0 


Even in the ground state, the wave function of a harmonic oscillator has a finite spread, 
of the order of \J h/lmco ( cf\ §D-2 of chapter V). The distance between the two nuclei 
of the molecule in the vibrational ground state is therefore defined only to within 


hjlmcD. An important condition for the decoupling of the vibrational and rotational 
degrees of freedom is therefore that: 



(7) 


(ii) When the reduced mass m is known, the measurement of co yields, according to (5), the 
second derivative V"(r e ). When the quantum number v increases, it is no longer possible 
to neglect terms in (r — r e ) 3 in expression (4) (which indicate the deviation of the 
potential well from a parabolic form). The oscillator then becomes anharmonic. Studying 
the effects of the term in (r — rj 3 of (4) by perturbation theory (as we shall do in 
complement A XI ), one finds that the separation E v+1 — E v of two neighboring states is 
not the same for large and small values of v. Studying the variation of E v + X - E v with 
respect to v thus enables us to obtain the coefficient V"'(r e ) of the term in (r - r e ) 3 . 
Thus we see how the study of the frequencies of molecular vibration enables us to 
define more precisely the form of the curve V(r) in the neighborhood of its minimum. 


y. Order of magnitude of the vibrational frequencies 

Molecular vibrational frequencies are commonly expressed in cm -1 , that is, by giving the 
inverse of the wavelength X (expressed in cm) of an electromagnetic wave of the same frequency v. 
Note that 1 cm" 1 corresponds to a frequency of 3 x 10 10 Hertz and to an energy of 
1.24 x 10 -4 eV. 

The vibrational frequencies of diatomic molecules fall between several tens and several 
thousands of cm -1 . The corresponding wavelengths therefore go from a few microns to a few 
hundred microns, consequently falling in the infrared. 

Formula (5) shows that as m decreases, co increases, co also increases with V"(r e ) 9 that is, 
with a greater curvature of the potential well at r = r e . Since r e is always of the same order of 
magnitude (a few A), V '(r e ) increases with the depth V 0 of the well: co therefore increases with 
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the chemical stability We shall consider some concrete illustrations of the preceding 
observations. 

The vibrational frequencies of the hydrogen and deuterium molecules (H 2 and D 2 ) are, 
respectively (not taking into account anharmonicity corrections): 

v Hi = 4 401 cm -1 

v D2 = 3 112cm -1 (8) 

The curve V(r) is the same in these two cases : the chemical bond between the two atoms depends 
only on the electronic atmosphere. However, the reduced mass of H 2 is half as large as that of D 2 . 
We must therefore have, according to (5), v Hi = \fl v D2 . This is in agreement with the experi¬ 
mental values (8). 

Now let us consider an example of two molecules which have about the same reduced 
mass but very different chemical stabilities. The molecule 79 Br 85 Rb is chemically stable 
(halogen-alkaline bond); its vibrational frequency is 181 cm -1 . Molecules of 84 Kr 85 Rb have 
been observed recently in optical pumping experiments. Their chemical stability is much lower, 
because krypton, which is a rare gas, is practically inert from a chemical point of view (in fact, 
the cohesion of the molecule is due only to Van der Waals forces). These molecules have 
been found to have a vibrational frequency of the order of 13 cm -1 . The considerable difference 
between this figure and the preceding one is due solely to the difference in chemical stability 
of the two types of molecules since the reduced masses are, to within a few per cent, practically 
the same. 


c. EXPERIMENTAL OBSERVATIONS OF NUCLEAR VIBRATION 

We shall now explain how nuclear vibration can be detected experimentally. 
In particular, we shall consider the interaction of the molecule with an electro¬ 
magnetic wave. 

a. Infrared absorption and emission 

First, let us assume the molecule to be heteropolar (composed of two 
different atoms). Since the electrons are attracted towards the more electronegative 
atom, the molecule generally has a permanent dipole moment D(r) which depends 
on the distance r between the two nuclei. Expanding /)(r) in the neighborhood 
of the equilibrium position r = r e \ 

D{r) = d 0 + d t (r - r e ) + ... (9) 

where d 0 and d v are real constants. 

When the molecule is in a linear superposition | > of several stationary 

vibrational states | (p v >, the mean value < i p(t) \ D(R) | i (/(t) > of its electric dipole 
moment oscillates about the value d 0 with a frequency of co/2n. The oscillatory 
term arises from the mean value of the term d { (R — r e ) of (9) (R — r e plays 
the same role in our problem as the observable X of the harmonic oscillator 
studied in §D-3 of chapter V). Now (R — r e ) has a non-zero matrix element 
between two states | <p v ) and | q> v , ) only when v — v' = ± 1. This selection rule 
enables us to understand why only one Bohr frequency o)/2n appears in the 
motion of < D(R) >(/) [the harmonic frequencies evidently appear when one takes 
into account the anharmonicity of the potential and terms of higher order in 
expansion (9); their intensity is however much weaker]. 
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This vibration of the electric dipole moment results in a coupling between the 
molecule and the electromagnetic field; the molecule can consequently absorb 
or emit radiation of frequency v. In terms of photons, the molecule can absorb 
a photon of energy hv and move from the state | > to the state | (p v+ , ) (fig. 2-a) 

or emit a photon hv by going from | </>„ > to | ) (fig. 2-b). 



\<Pv- i > 
FIGURE 2 


b 

| <Pv + l > - 



Absorption (fig. a) or emission (fig. b) of a photon of energy hv by a heteropoiar molecule going from 
the vibrational state | (p v > to the state I <P v + t } or | , >. 


p. The Raman effect 

Now let us consider a homopolar molecule (consisting of two identical 
atoms). Because of symmetry, the permanent electric dipole moment is then zero 
for all r, and the molecule is “inactive” in the infrared. 

Imagine that an optical wave of frequency £2/27r strikes this molecule. This 
frequency, much higher than those considered previously, is able to excite the 
electrons of the molecule; under the effect of the optical wave, the electrons will 
undergo forced oscillation and re-emit radiation of the same frequency in all 
directions. This is the well-known phenomenon of the molecular scattering of 
light (Rayleigh scattering)*. What new phenomena are produced by the vibration 
of the molecule ? 

What happens can be explained qualitatively in the following way. The 
electronic susceptibility** of the molecule is generally a function of the distance r 
between the two nuclei. When r varies (recall that this variation is slow compared 
to the motion of the electrons), the amplitude of the induced electric dipole 
moment, which vibrates at a frequency of Q/2n, varies. The time dependence 
of the dipole moment is therefore that of a sinusoid of frequency Q/2n whose 
amplitude is modulated at the frequency of the molecular vibration a>/2n, which 

* In complement A XIII , we shall use quantum mechanics to study the forced motion of the elec¬ 
trons of an atom under the effect of incident light waves. 

** Under the effect of the field E 0 e iQt of the incident optical wave, the electronic cloud of the 
molecule acquires an induced dipole moment D given by: 

D = *(G)E 0 e'°' 

X(fl) is, by definition, the electronic susceptibility of the molecule. The important point here is that x 
depends on r. 
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FIGURE 3 

The vibration of a molecule 
modulates the amplitude of 
the oscillating electric dipole 
induced by an incident light 
wave. 


is much smaller (fig. 3). The frequency distribution of the light emitted by the 
molecule is given by the Fourier transform of the motion of the electric dipole 
shown in figure 3. It is easy to see (fig. 4) that there exists a central line of 
frequency Q/2n (Rayleigh scattering) and two shifted lines, of frequency 
(Q “ (o)/2it (Raman-Stokes scattering) and frequency (Q + co)/2n (Raman-anti- 
Stokes scattering). 


Q_ 

In 



FIGURE 4 

Spectrum of the oscillations shown in figure 3. 
In addition to the central line, whose frequency 
is the same as that of the incident light wave 
(Rayleigh line), two shifted lines appear 
(the Raman-Stokes and Raman-anti-Stokes 
lines). The frequency shift is equal to the 
vibrational frequency of the molecule. 


It is very simple to interpret these lines in terms of photons. Consider an 
optical photon of energy TiQ which strikes the molecule when it is in the state 
| <p v > (fig. 5-a). If the molecule does not change vibrational states during the 



\<Pv> 

h Q ^ 


b 

c 

d 

|K> 

§ K +1> 

| K-«> 

sc \ 

h(Q — co)V^ 

h(Q + 


FIGURE 5 


Schematic representation of the scattering of a photon of energy hQ by a molecule which is initially in 
the vibrational state | <p v > (fig. a): Rayleigh scattering without a change in the vibrational state 
(fig. b); Raman-Stokes or Raman-anti-Stokes scattering with a change in the molecule’s state froqi 
I <P„ > to | <p„ + , > (fig. c) or to | <p„_, > (fig. d). 
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scattering process, the scattering is elastic. Because of conservation of energy, the 
scattered photon has the same energy as the incident photon (fig. 5-b; Rayleigh line). 
However, the molecule, during the scattering process, can make a transition from 
the state | <p v ) to the state | cp v + t ). The molecule acquires an energy ha> at the 
expense of the scattered photon, whose energy therefore is Ti(Q — co) (fig. 5-c): 
the scattering is inelastic (Raman-Stokes line). Finally, the molecule may move 
from the state | <p v } to the state | <p v - x >, in which case the scattered photon 
will have an energy of h(Q + a>) (fig. 5-d; Raman-anti-Stokes line). 

comments : 

(i) The Raman effect can also be observed with heteropolar molecules. 

(») The Raman effect has recently enjoyed a revival of interest because of the development 

of lasers. If, in the cavity of a laser oscillating at a frequency of Q/2n , one places a cell 
filled with a substance which exhibits the Raman effect, one can, in certain cases, obtain 
an amplification (stimulated Raman effect) and hence a laser oscillation at the 
frequency (ft — co)/2n, where co is the vibrational frequency of the molecules in the cell 
(Raman laser). Thus, by varying this substance, one can vary the oscillation frequency 
of the laser. 

(hi) The study of Raman and infrared spectra of molecules is useful in chemistry because it 
permits the identification of the various bonds which exist in a complex molecule. 
For example, the vibration frequency of a group of two carbon atoms depends on 
whether the bond between them is single, double or triple. 


2. Vibration of the nuclei in a crystal 


a. THE EINSTEIN MODEL 


A crystal consists of a system of atoms (or ions) which are regularly 
distributed in space, forming a periodic lattice. For simplicity, let us choose 
a one-dimensional model in which we consider a linear chain of atoms. The average 
position of the nucleus of the qlh atom is: 

x° = qd (10) 


where d is the distance between adjacent atoms (on the order of a few A). 

Let U(x j, x 2 , ..., x q , ...) be the total potential energy of the crystal nuclei, 
which depends on their positions x l5 x 2 ,..., x q ,... If x q — x q is not too large, that is, 
if each nucleus is not too far from its equilibrium position, U(x u x 2 , ..., x q , ...) 
has, in certain cases, the following simple form: 


£/(x„x 2 , ...x q ,...) = U 0 + X^(x,-x°) 2 t/" + -. 


( 11 ) 


where U 0 and Uq are real constants (with U'q > 0). The absence of terms 
linear in x q — x q shows that x q is a stable equilibrium position for the nucleus (q) 
(a minimum of U). We add to (11) the total kinetic energy: 



( 12 ) 
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where p q is the momentum of the nucleus ( q ) of mass m. The total Hamiltonian H 
of the system is, to within the constant U 0 , a sum of Hamiltonians of one¬ 
dimensional harmonic oscillators centered at each nucleus (q): 

H=U„ + I [|£+!<*,-*?>’03) 


Consequently, in this simplified model, each nucleus vibrates about its 
equilibrium position independently of its neighbors, with an angular frequency : 


to = 



(14) 


As in the case of the diatomic molecule, to increases when m decreases and when 
the curvature of the potential attracting the nucleus towards its equilibrium 
position increases. 


COMMENT: 

In the simple model which we have just presented, each nucleus vibrates indepen¬ 
dently of the others. This is because the proposed potential U does not contain any terms 
which are simultaneously dependent on more than one of the variables x q , as it would 
if it described intemuclear interactions. This model is not realistic since such inter¬ 
actions do, in fact, exist. In complement J v , we shall present a more elaborate model 
which takes into account the coupling between each nucleus and its two nearest neighbors. 
We shall see that it is still possible, in this model, to put the total Hamiltonian of the 
system in the form of a sum of Hamiltonians of independent harmonic oscillators. 


b. THE QUANTUM MECHANICAL NATURE OF CRYSTALUNE VIBRATIONS 

Despite its very schematic character, the Einstein model enables us to 
understand a certain number of phenomena related to the quantum mechanical 
nature of crystalline vibrations. The low temperature behavior of the constant 
volume specific heat, which cannot be explained using classical mechanics, will 
be described in complement Ly in connection with the study of the properties 
of a harmonic oscillator in thermodynamic equilibrium. Here, we shall discuss 
a spectacular effect related to the finite spread of the wave functions associated 
with the position of each atom in the ground state. 

At absolute zero, under a pressure of one atmosphere, all substances 
except helium are solids. To solidify helium, it is necessary to apply a pressure 
of at least 25 atmospheres. Can this peculiarity be explained qualitatively? 

First let us try to understand the phenomenon of the melting of an ordinary 
substance. At absolute zero, the atoms are practically localized at their equilibrium 
positions; the spread of their wave functions about the x° is given by [cf formula 
(D-5-a) of chapter V] : 
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[where we have used expression (14) for eo]. AX is, in general, very small. When 
the crystal is heated, the nuclei move into higher and higher vibrational states : 
in classical language, they vibrate with a larger and larger amplitude; in quantum 
mechanical language, the spread of their wave functions increases [with the square 
root of the vibrational quantum number; see formula (D-5-a) of chapter V]. 
When this spread is no longer negligible with respect to the interatomic distance d, 
the crystal melts (see §4-c of complement L v , in which this phenomenon is 
studied more quantitatively). 

It is impossible to solidify helium at ordinary pressures. This corresponds to 
the fact that, even at absolute zero, the spread of the wave function given by (15) 
is not negligible compared to d. This results from the fact that the mass of 
helium is very small and its chemical affinity, very weak (the curvature Uq of the 
potential in the neighborhood of each minimum is very small, since the potential 
wells are very shallow). The effect of both of these factors, in formula (15), is the 
same : a large spread AX. An increase in the pressure results in an increase in U'o 
and therefore, in to; consequently, AX decreases. This is due to the fact that, 
at high pressures, each helium atom is “wedged” between its neighbors: 
the smaller the average distance between these neighbors (the higher the pressure), 
the sharper the potential minimum (the greater IT 0 ). Thus we see how an increase 
in the pressure makes the solidification of helium possible. 


3. Torsional oscillations of a molecule: ethylene 

a. STRUCTURE OF THE ETHYLENE MOLECULE C 2 H 4 

The structure of the molecule C 2 H 4 is well-known : the six atoms of the 
molecule are in the same plane (fig. 6) and the angles between the various C —H 
and C —C bonds are close to 120°. 



FIGURE 6 

Plane structure of the ethylene molecule. 


Now imagine that, without changing the relative positions of the bonds 
of each carbon atom, we rotate one of the CH 2 groups, about the C —C axis, 
through an angle a with respect to the other one. Figure 7 represents the 
molecule as seen along the C—C axis: the C—H bonds of one CH 2 group are 
shown in solid lines and those of the other one, in dashed lines. How does the 
potential energy V(a) of the molecule vary with respect to a? 
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FIGURE 7 

Torsion of the ethylene molecule (seen along the C-C 
axis) : one of the CH 2 groups has rotated with respect 
to the other one through an angle a about the C-C axis. 


Since the stable structure of the molecule is planar, the angle a = 0 must 
correspond to a minimum of V{ot). It is also clear that a = n corresponds to 
another minimum of F(a), since the two structures associated with a = 0 and 
a = 7r are undistinguishable. V(ol) therefore has the form shown in figure 8 
[a varies from — n/2 to 37 t/2 ; V(0) is chosen as the energy origin]. 



FIGURE 8 

The potential energy of the mole¬ 
cule depends on the torsion angle ot ; 
V(ct) is minimal for a = 0 and 
a = 7t (planar structures). 


The two stable positions a = 0 and a = n are separated by a potential 
barrier of height V 0 . The potential of figure 8 is often approximated by the 
simple formula: 

V(a) = ^(1 - cos 2a) (16) 

COMMENT : 

Quantum mechanics enables us to interpret all the features of the C 2 H 4 molecule 
which we have just described. In this molecule, each carbon atom has four valence 
electrons. Three of these electrons (o' electrons) are found to have wave functions which 
are symmetrical about three coplanar lines making angles of 120° with each other, 
and defining the directions of the chemical bonds (fig. 6). These wave functions overlap 
those of the electrons of the neighboring atoms to a considerable extent, and it is this 
overlap that insures the stability of the C —H bonds and of part of the C-C bond 
(this phenomenon is called “sp 2 hybridation” and will be studied in greater detail in 
complement E v „). The last valence electron of each carbon atom (n electron) has a wave 
function which is symmetrical about a line passing through C and perpendicular to the 
plane defined by C and its three neighbors. The overlap of the wave functions of the 
two n electrons is maximum and, consequently, the chemical stability of the double 
bond is greatest when the two lines associated with the n electrons are parallel, that is, 
when the six atoms of the molecules are in the same plane. The structure of figure 6 is 
thus entirely explained. 
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Since V(a) can be approximated by a parabola in the neighborhood of its 
two minima, the molecule performs, about its two stable equilibrium positions, 
torsional oscillations which we shall now examine. First, we shall review rapidly 
the corresponding classical equations. 

b. CLASSICAL EQUATIONS OF MOTION 

We denote by a l and a 2 the angles formed by the planes of the two CH 2 groups 
with a fixed plane passing through the C —C axis (fig. 9). The angle in figure 7 is 
obviously: 

a = a, - a 2 (17) 



FIGURE 9 

To write the classical equations of motion, we denote 
by a, and a 2 the angles formed by the planes of the two 
CH 2 groups with a fixed plane. 


Let I be the moment of inertia of one of the CH 2 groups with respect to the 
C —C axis. Since the potential energy depends only on a = a l — a 2 , the dynamical 
equations describing the rotation of each group are written: 


= - — V(a l - a 2 ) = - — V{a) 

df 2 5a, da 

, d 2 a 2 d ... . d , 

l —f = - — K(a, - a 2 ) = + — K(a) 

at da , da 


Adding and subtracting these two equations we obtain: 

77 («i + a 2 ) = 0 
df 2 

/—-2 — V(a) 

d t 2 da 


(18) 


(19-a) 

(19-b) 


Equation (19-a) indicates that the entire molecule can rotate freely about 
the C —C axis independently of the torsional motion: the angle (a, + a 2 )/2 of 
the plane bisecting the planes of the two CH 2 groups is a linear function of 
time. Equation (19-b) describes the torsional motion (rotation of one group with 
respect to the other). Let us consider this motion in the immediate neighborhood 
of one of the stable equilibrium positions, a = 0. Expanding expression (16) in the 
neighborhood of a = 0 : 

V(a) s V 0 a 2 (20) 
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Substituting (20) into (19-b), we obtain : 

^+^a = 0 (21) 

dt 2 / 

We recognize (21) as the equation of a one-dimensional harmonic oscillator 
(a is the only variable) of angular frequency : 

(v 

co, = iji ( 22 ) 


For the C 2 H 4 molecule, co, is of the order of 825 cm 


c. QUANTUM MECHANICAL TREATMENT 

In the neighborhood of its two equilibrium positions a = 0 and a = n, the 
molecule possesses “torsional states” of quantized energy E n = (n + \j2)Tw)„ 
with n = 0, 1, 2, ... In a first approximation, each energy level E„ = {n + \/2)Tuo, 
is therefore doubly degenerate, since for each one there are two states | cp n > and 
| cp' H ) whose wave functions cp n (a.) and cp'„(oi) differ only in that one is centered at 
a = 0 and the other, at a = n (fig. 10-a and 10-b). 

In fact, we must also take into account a typically quantum mechanical effect: 
the tunnel effect across the potential barrier separating the two minima (fig. 8). 



FIGURE 10 

When one neglects the tunnel effect 
across the potential barriers at 
a - jt/2 and a. = 3rc/2, one can 
find torsional states of the molecule 
localized in the wells centered at 
a. = 0 (fig. a) and a = it (fig. b). 
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We have already encountered a situation of this type, in complement G IV , in 
connection with the inversion of the NH 3 molecule. Calculations analogous to 
those in that complement would enable us to show here that the degeneracy 
between the two states | (p n > and | (p' n > is removed by the tunnel effect. Thus, for 
each value of n , two stationary states, | \j/ n + > and | >, appear (to a first 






hs 0 

EZ 


FIGURE 11 

The tunnel effect removes the 
degeneracy of the energy levels 
which correspond to figure 10. As 
one approaches the top of the 
barrier, this phenomenon becomes 
more important (<5j > <5 0 ). | >» 

I | >, |> are the new 

stationary states. 


approximation, they are symmetrical and antisymmetrical linear combinations of 
| <(> n > and | <p' >). The larger n (that is, the closer the initial energy E n is to V 0 , 
and hence, the more important the tunnel effect), the greater their energy 
difference hS n . However, hS n is always much smaller than the distance Ttco, between 
adjacent levels n and n ± 1 (fig. 11). 

For the mean value of the angle a, quantum mechanics therefore predicts 
the following motion : rapid oscillations of frequency u> t about one of the two 
values a = 0 and a = n, upon which are superposed much slower oscillations 
between a = 0 and a = n, at the Bohr frequencies S 0 /2n, SJ2n, S 2 /2n... 

COMMENT: 

States of course exist for which the energy is greater than the maximum height V 0 
of the potential barrier of figure 8. These states correspond to a rotational kinetic 
energy which is large enough for one of the CH 2 groups to be considered as rotating 
almost freely with respect to the other one (while being, nevertheless, periodically slowed 
down and accelerated by the potential of figure 8). 

The ethane molecule C 2 H 6 behaves in this way. The absence of n electrons in this 
molecule permits a much freer rotation of one of the CH 3 groups with respect to the other 
(the potential barrier V 0 is much lower). In this case, the potential V(<x), which tends 
to oppose the free rotation of one of the CH 3 groups with respect to the other, has 
a period of 2n/3 because of symmetry. 


4. Heavy muonic atoms 

The muon n~ (sometimes called, for historical reasons, the “/t meson”) 
is a particle which has the same properties as the electron except that its mass 
is 207 times greater*. In particular, it is not sensitive to strong interactions, and 


* The muon is unstable: it decays into an electron and two neutrinos. 


524 
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its coupling with nuclei is essentially electromagnetic. A muon p ~ which has been 
slowed down in matter can be attracted by the Coulomb field of an atomic nucleus 
and can form a bound state with the nucleus. The system thus constituted is 
called a muonic atom. 


a. COMPARISON WITH THE HYDROGEN ATOM 


In chapter VII (§C), we shall study the bound states of two particles of 
opposite charge and, in particular, those of the hydrogen atom. We shall see that 
the results of quantum mechanics concerning the energies of bound states are 
the same as those of the Bohr model (chap. VII, §C-2). Similarly, the spread of 
the wave functions which describe these bound states is of the order of the Bohr 
orbital radius. Let us therefore begin by using this simple model to calculate the 
energies and spreads of the first bound states of a muon in the Coulomb field 
of a heavy atom such as lead (Z = 82, A = 207). 

If we consider the nucleus to be infinitely heavy, the nth Bohr orbital has 
an energy of: 


E 


n 


Zbruf 1 
2h 2 n 2 


(23) 


where Z is the atomic number of the nucleus, e 2 = q 2 /4ne 0 (where q is the 
electron charge), and m represents the mass of the electron or of the muon, depending 
on the case. When one goes from hydrogen to the muonic atom under study 
here, E„ is multiplied by a factor of Z 2 mjm e = (82) 2 x (207) = 1.4 x 10 6 . From 
this we deduce that, for the muonic atom : 


f £, = - 19 MeV 
[ E 2 = - 4.7 MeV 


The radius of the nth Bohr orbital is given by : 

_ n 2 Ti 2 
Zme 2 


(25) 


For hydrogen, r t ~ 0.5 A. Here, this number must be divided by Zmjm e , which 
gives: 


r, = 3 x 10 13 cm 
r 2 = 12 x I0 -13 cm 


(26) 


In the preceding calculations, we have implicitly assumed the nucleus to be 
pointlike (in the Bohr model and in the theory presented in chapter VII, §C, 
the potential energy is taken equal to — Ze 2 /r). The small values found for r, 
and r 2 [formulas (26)] show us that this viewpoint is not at all valid for a 
heavy muonic atom. The lead nucleus has a non-negligible radius p 0 , on the order 
of 8.5 x 10“ 13 cm (recall that the radius of a nucleus increases with A i>2 ). The 
preceding qualitative calculation therefore leaves us with the impression that the 
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spread of the wave functions of the muon may be smaller than the nucleus*. Conse¬ 
quently, we must reconsider the problem completely and first calculate the potential 
“seen” by the muon on the inside as well as on the outside of the nuclear charge 
distribution. 


b. THE HEAVY MUONIC ATOM TREATED AS A HARMONIC OSCILLATOR 


We shall use a rough model of the lead nucleus: we shall assume its 
charge to be evenly distributed throughout a sphere of radius p 0 = 8.5 x 10 -13 cm. 

When the distance r of the muon from the center of this sphere is greater 
than p 0 , its potential energy is given by : 

V(r) - " for r>p 0 (27) 

For r < p 0 , one can calculate the electrostatic force acting on the muon, 
using Gauss’ theorem; it is directed towards the center of the sphere and its 
absolute value is: 



This force is derived from the potential energy : 

V{r) = r 2 + C for r p 0 (29) 

2 Po 

The constant C is determined by the condition that expressions (27) and (29) 
be identical for r = p 0 : 


c-~ 


^ ^ w 

2 Po 

Figure 12 represents the potential energy of the muon, plotted with respect to r. 



FIGURE 12 

Form of the potential V(r) seen by a 
muon attracted by a nucleus of 
radius p 0 situated at r = 0. When 
r < p 0 , the variation of the potential is 
parabolic (if the charge density of the 
nucleus is uniform); when r > p 0 , 
V(r) varies like 1/r (Coulomb's law). 


* For hydrogen, the spread of the wave functions, on the order of an Angstrom, is about 10 5 times 
larger than the dimensions of the proton, which can therefore be treated like a point. The new situation 
encountered here results from several factors which reinforce each other : increased m and increased Z, 
which results in a greater electrostatic force and a larger nuclear radius. 
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Inside the nucleus, the potential is parabolic. The orders of magnitude we 
calculated in §a indicate that it would not be realistic to choose a pure Coulomb 
potential for the ground state of the muonic lead atom since the wave function 
is actually concentrated in the region where the potential is parabolic. It is 
therefore certainly preferable to consider the muon to be “elastically bound” to the 
nucleus in this case. We then have a three-dimensional harmonic oscillator (comple¬ 
ment E v ) whose angular frequency is: 


to = 



V n^Po 


(31) 


In fact, we shall see that the wave function of the ground state of this harmonic 
oscillator is not zero outside the nucleus, so the harmonic approximation is not 
perfect either. 


comment: 

It is interesting that the physical system studied here presents many analogies 
with the first atomic model, proposed by J. J. Thomson. This physicist assumed the 
positive charge of the atom to be distributed in a sphere whose radius was of the order 
of a few Angstroms, with the electrons moving in the parabolic potential existing inside 
this charge distribution (model of the elastically bound electron). We know from 
Rutherford’s experiments that the nucleus is much smaller and that such a model does 
not correspond to reality for atoms. 


c. ORDER OF MAGNITUDE OF THE ENERGIES 
AND SPREAD OF THE WAVE FUNCTIONS 

If we substitute into expression (31) the numerical values: 

c = 3 x 10 8 m/sec 

m )i = 207 m e = 1.86 x 10 - 28 kg 

h a: 1.05 x 10 _ 34 Joule . sec p 0 = 8.5 x 10 -15 m 

we find: 

o i 1.3 x 10 22 rad. sec -1 (32) 

which corresponds to an energy hio on the order of: 

hi w ~ 8.4 MeV (33) 

3 Ze 2 

We can compare hio to the total depth of the well —-, which is equal to: 

2 p o 

3 7p 2 

~ 21 MeV (34) 

2 Po 

We see that hio is smaller than this depth, but not small enough for us to be able 
to neglect completely the non-parabolic part of V(r). 


Z = 82 
he ~ 137 
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Similarly, the spread of the ground state, if the well weie perfectly parabolic, 
would be on the order of: 


Lr~— ^ 4.7 x 10 13 cm (35) 

V 2 m^a) ' 

The qualitative predictions of §4-a are therefore confirmed : a large part of 
the wave function of the muon is inside the nucleus. Nevertheless, what happens 
outside cannot be completely neglected. 

The exact calculation of the energies and the wave functions is therefore 
more complicated than it would be for a simple harmonic oscillator. The Schrodinger 
equation corresponding to the potential of figure 12 must be solved (taking into 
account, in addition, spin, relativistic corrections, etc...). Such a calculation is 
important: the study of the energy of photons emitted by a heavy muonic atom 
contributes information about the structure of the nucleus, for example concerning 
the real charge distribution inside the nuclear volume. 

comment: 

In the case of ordinary atoms (with an electron instead of a muon), it is valid to 
neglect the effects of the deviation of the potential from the — Ze 2 /r law. However, one 
can take this deviation into account by using perturbation theory (cf. chap. XI). In 
complement D XI , we shall study this “volume effect” of the nucleus on the atomic energy 
levels. 


References and suggestions for further reading : 


Molecular vibrations: Karplusand Porter (12.1), chap. 7; Pauling and Wilson (1.9), 
chap. X; Herzberg (12.4), Vol. I, chap. Ill, §1; Landau and Lifshitz (1.19), chaps. XI 
and XIII. 

Stimulated Raman effect: Baldwin (15.19), § 5.2; see also Schawlow’s article 
(15.17). 

Torsion oscillations: Herzberg (12.4), Vol. II, chap. II, §5d; Kondratiev (11.6), 
§37 

The Einstein model: Kittel (13.2), chap. 6; Seitz (13.4), chap. Ill; Ziman (13.3), 
chap. 2; see also Bertman and Guyer’s article (13.20). 

Muonic atoms: Cagnac and Pebay-Peyroula (11.2), §XIX-7; Weissenberg (16.19), 
§4-2; see also De Benedetti’s article (11.21). 
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Complement B v 

STUDY OF THE STATIONARY STATES 

IN THE { | x > } REPRESENTATION. HERMITE POLYNOMIALS 


1. Hermite polynomials 

a. Definition and simple properties 

b. Generating function 

c. Recurrence relations; differential equation 

d. Examples 

2. Eigenfunctions of the harmonic oscillator Hamiltonian 

a. Generating function 

b. <p n (x) in terms of Hermite polynomials 

c. Recurrence relations 


We now intend to study, in a little more detail than in §C-2 of chapter V, the 
wave functions (p„{x) = < x \ <p„ > associated with the stationary states | <p„ > of the 
harmonic oscillator. Before undertaking this study, we shall define the Hermite 
polynomials and mention their principal properties. 

1. Hermite polynomials 

a. DEFINITION AND SIMPLE PROPERTIES 

Consider the Gaussian function: 

F(z) = e" z2 (1) 

represented by the bell-shaped curve in figure 1. The successive derivatives of Fare 
given by: 

F(z) = — 2z e - * 2 (2) 

F"(z) = (4z 2 — 2)e - * 2 (3) 



FIGURE I 

Shape of the Gaussian function F(z) and 
of its first and second derivatives F'(z) 
and F"(z\ 
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The Mth-order derivative, F {n) (z), can be written : 

F in) (z) = (- ir//„(z)e- 2 (4) 

where H n (z ) is an nth-degree polynomial in z. The proof is by recurrence. This relation 
is valid for n = 1,2 [cf. equations (2) and (3)]. Assume it is true for n - 1: 

F in ~ x Xz) = (- lrX-.We' 22 (5) 

where //„_ t (z) is a polynomial of degree n — 1. We then obtain relation (4) 
directly by differentiation, if we set: 

H„(z) = (2 Z --^)h„_ 1 (z) (6) 

Since H n _ l {z) is a polynomial of degree n — 1 in z, we see from this last relation 
that tfjz) is indeed an nth-degree polynomial. The polynomial H n (z) is called 
the nth-degree Hermite polynomial. Its definition is therefore: 

H„(z)-(- 'f (V 

dz" 

We see from (2) and (3) that H x {z) and H 2 (z) are, respectively, even and odd. 
Moreover, relation (6) shows that if H n _ l (z) has a definite parity, H n (z) has the 
opposite parity. From this, we deduce that the parity of H n (z ) is (— 1)". 

The zeros of H n (z) correspond to those of the nth-order derivative of the 
function F(z). We are going to show that H n (z) has n real zeros, between which one 
finds those of H n _ l . It can be seen from figure 1 and from relations (1), (2) and (3) 
that this is true for n = 0, 1, 2. Arguing by recurrence, we can generalize this 
result: assume that H n _ l (z) has n — 1 real zeros; if z t and z 2 are two consecutive 
zeros of H n _ 1 (z) and therefore of F <n_1) (z), Rolle’s theorem shows that the 
derivative jF <b) (z) of F i "~ 1) (z) goes to zero at a point z 3 between z x and z 2 ; therefore, 
H n {z 3 ) = 0. Since, in addition, F (n-1) (z) goes to zero when z —► — oo and 
when z —>• + oo, F (n) (z) and H n (z) have at least n real zeros [and not more, 
because H n (z) is Mth-degree] between which are interposed those of H n _ l (z). 

b. GENERATING FUNCTION 

Consider the function of z and X : 

F(z + X) = e -<z + ii)2 
Taylor’s formula enables us to write: 

F(z + X)= £ £f<»>(z) 

= f £<- 1 

n= 0 n ‘ 

Multiplying this relation by e z2 and replacing X by — X we obtain: 

e z2 F(z - X) = £ ^ H n (z) 

« = 0 n ‘ 


( 8 ) 

( 9 ) 

( 10 ) 
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that is, if we replace F(z — A) by its value: 

e-"* 2 " = t (11) 

The Hermite polynomials can therefore be obtained from the series expansion in A 
of the function e ~ A2 + 2,lz , which for this reason, is called the generating function 
of the Hermite polynomials. 

Relation (11) gives us another definition of the Hermite polynomials H n (z): 



c. RECURRENCE RELATIONS: DIFFERENTIAL EQUATION 


We have already obtained, in (6), one recurrence relation. It is easy to obtain 
others by differentiating relation (11). A differentiation with respect to z yields: 


oo t n j 


( 13 ) 


that is, replacing e Xl + 2X: by the expansion (11) and setting equal terms of the 
same power in A: 

j- z H n {z) = 2n H n _ l (z) (14) 

Similarly, if we differentiate (11) with respect to A, an analogous argument yields: 
H n (z) = 2z H„_ x (z) - 2(n - 1 ) H„. 2 (z) (15) 


Finally, it is not difficult to obtain a differential equation satisfied by the 
polynomials H n (z). Differentiating (14) and using (6), we obtain: 


= .2n [2zH n _ 1 (z) - H n (z)1 


(16) 


that is, replacing H n _ l {z) by its value as given in (14): 



(17) 


d. EXAMPLES 

Definition (7) or the recurrence relation (6) (which amounts to the same 
thing) enables us to calculate the first Hermite polynomials easily: 

H 0 (z) = 1 
//,(z) = 2 z 
H 2 {z) = 4z 2 - 2 

N 3 (z) = 8z 3 - 12z (18) 
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In general: 
H n (z) = (lz 


2 . The eigenfunctions of the harmonic oscillator Hamiltonian 


a. GENERATING FUNCTION 


Consider the function: 


K(A,x)= X — 7 = | > 


n- 0 \/n[ 


Using the relation \cf. chap. V, relation (C-13)]: 

I (« f )“ | <Po > 

v n! 

we obtain: 

00 (la* V* 
x) = £ < * | I Po > 

n = 0 n ' 

= < * | e Aa ' | <p 0 > 

Introducing, as in chapter V, the dimensionless operators X and P: 
f X = fiX 


where the parameter /?, which has the dimensions of an inverse length, is defined by: 


The operator: 


e*’ = eVr ’ 

can be calculated by using formula (63) of complement B„, where we set: 


A =^pX 

V2 


~2 


We obtain: 


-A=x —‘^=p l&ix.n 
e Xa = e V5 e </2 e 4 

j V __ p 

= ev5 e ^ e‘ w 
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Substituting this result into (22), we obtain : 

K{X, x) = e~ xl * < x | e wv5) ^~ iX/ ^ p \ (p 0 > 


= e --* 2 /4 e mx/j2 < x | t (-a,V2 ) pm 1 <Po > 

Now, we have [ cf. complement E„, formula (15)] : 

(28) 

< x | e W* = <x - A//W 2 | 
and (28) can be written : 

K(X, x) = e~* 2/4 - e 0Xxl ^ < x - X/fi-sfl \<p 0 > 

(29) 

= e~ 1,2/4 e <Ux/v ^ <p 0 (x - X/fis/l) 

Using formula (C-25) of chapter V, we obtain, finally : 

(30) 

K(A, x) - (£) ' exp j- Olf- + /Uxs/2 - ^ j 

(31) 

According to definition (20), all we must do to find the 
tions <p n {x) = < x | <p n ) is expand this expression in powers of X : 

wave func- 

K(i, x) = f -£= «.„(*) 

n = 0 fi j 

K(X, x) is called the generating function of the <p„(x). 

(32) 


b. </>„(*) IN TERMS OF THE HERMITE POLYNOMIALS 

Replacing, in formula (11), X by X/sfi and z by fix, we obtain : 


Substituting this expression into (31): 

(34) 

Setting equal the coefficients of the different powers of X in (32) and (34), we obtain : 

«’■<*> - (t) ' J- (35) 

W V 2 " n! 

The shape of the function <p M (jc) is therefore analogous to that of the nth-order 
derivative of the Gaussian function F(x) considered in §1 above; <p n (x) is of 
parity (—1)" and possesses n zeros interposed between those of <p n+ 1 (x). We 
mentioned in §C -2 of chapter V that this property is related to the increase in the 
average kinetic energy of the states | cp„ > when n increases. 
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c. RECURRENCE RELATIONS 

Let us write the equations: 

«!<?„> = V ”|y„- 1 > 

| <?» ) = V « + 1 | <p„+ , > (36) 

in the { | x > } representation. Using the definitions of a and a* \cf. chap. V, 
relations (B-6)], we see that in the ( | x > } representation, the action of these 
operators is given by: 



Equations (36) therefore become: 



Let us take the sum and difference of these equations : 

xpy/2<p„{x) = \fn<p n ^ x (x) + v'« + l<P„+i(x) (39) 

~T dx = 'Jn<p K . l (x) - \fn -h 1 <p„ + ,(x) (40) 

COMMENT : 

If we replace the functions <p„{x) in (39) and (40) by their expressions 
given in (35), we obtain, after simplification (setting x = fix ): 

2* HJi) = 2n H._ ,(*) + H.. ,(*) (41) 

+ ^ «.(*)] = 2■«.- .(*) - i(«) (42) 

By taking the sum and the difference of these equations, we obtain relations ( 6 ) 
and (14) of §1. 


References 

Messiah (1.17), App. B, §111; Arfken (10.4), chap. 13, §1; Angot (10.2), §7.8. 
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SOLVING THE EIGENVALUE EQUATION 
OF THE HARMONIC OSCILLATOR 
BY THE POLYNOMIAL METHOD 


1. Changing the function and the variable 

2. The polynomial method 

a. The asymptotic form of <p(x) 

b. The calculation of h(x) in the form of a series expansion 

c. Quantization of the energy 

d. Stationary wave functions 

In §B of chapter V, the method used to calculate the energies of the harmonic 
oscillator stationary states \(p „) is based on the use of the operators a, a’ and N 
and their commutation relations. It is possible to obtain the same results in a 
completely different way, by solving the eigenvalue equation of the Hamiltonian H 
in the { |x> } representation. This is what we are going to do in this complement. 


1. Changing the function and the variable 

In the { | x ) } representation, the eigenvalue equation of H is written : 

["- T~"T! + “ rna) 2 x 2 ~\<p(x) = E (p( x ) (1) 

|_ 2m ax 2 J 

As in chapter V, let us introduce the dimensionless operators X and P : 

(x = px 


P = 


( 2 ) 


ph 


where the parameter /?, which has the dimensions of an inverse length, is defined by: 


fi = 


V 


mat 

T 


(3) 


Let us denote by | ^ > the eigenvector of X, whose eigenvalue is Jc : 

r |{,> = *|0 

The orthonormalization and closure relations of the kets 1 4x ) are written : 
< U > - 5(x - x') 

/• + oo 

dx I > < Zs I = 1 


(4) 

(5) 

( 6 ) 
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The ket \£ i ) is obviously an eigenvector of X, with the eigenvalue x/p. 
Therefore, when: 

x = px ( 7 ) 

the two kets | x > and | > are proportional. However, they are not equal. 

Writing the closure relation for the kets | x > : 

/* + 00 

dx I x > < x I = 1 (8) 


and making the change of variables given in (7), we obtain : 
f HT I x = x/p > < x = x/p I = 1 


Comparison with (6) shows that we can, for example, set: 

| x = x/p > = y/~P > (10) 

to orthonormalize the kets | ) with respect to x, since the kets | x > are orthonormal 

with respect to x. 

Let | <p > be an arbitrary ket, <p(x) = < x | (p ) its wave function in the { | x ) } 
representation, and q>(x) = < x | (p ) its wave function in the { > } representation. 

According to (10): 

<p{x) = < Zi | (p > = - 7 = < x = x/P | (p > (11) 


that is: 


<p(x) = — (p(x = x/P) 


if k> is normalized, relation ( 8 ) yields: 


a -r oo \ /• + oo 

dx|x><x|j|<jp> = (p*(x) <p{x) dx = 1 (13) 

“ 00 / J — 00 


and relation ( 6 ) gives: 


< (p <p} = <<P 


dx 


l) \<P > = | ^ 


‘{x)<p(x)dx = 1 (14) 


The wave function (p(x) is therefore normalized with respect to the variable x, 
as is <p(x) with respect to the variable x. [This can be shown by using the integral 
in (13), in which we make the change of variables indicated in (7)]. 

Now, substituting (7) and (12) into (1), we obtain: 


-jn + * 2 1 $*) = 6 <?(*) 

2 L dx 2 J 
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setting: 


E_ 

fuo 


( 16 ) 


Equation (15) is more convenient than equation (1) since all the quantities appearing 
in it are dimensionless. 


2. The polynomial method 

a. THE ASYMPTOTIC FORM OF q>(k) 

Equation (15) can be written: 

r£j-(x 2 -2 E )l^(x) = 0 (17) 

Let us try to predict intuitively the behavior of <p{x) for very large x. To do this, 
consider the functions: 

G ± (x) = e ±x2/2 (18) 

They are solutions of the differential equations: 

|j^-(x 2 ± 1 )Jg ± (x) = 0 (19) 

When x approaches infinity: 

x 2 ± 1 ~ x 2 ~ x 2 — 2e (20) 

and equations (17) and (19) take on the same form asymptotically. We should 
therefore expect the solutions of equation (17) to behave, for large x* either 
like e i2/2 or like e~* 2/2 . From a physical point of view, the only functions $(x) 
which are of interest to us are those which are bounded everywhere. This restricts 
us to solutions of (17) which behave like e _52/2 (if they exist). This is why we shall 
set: 

<p(x) = e -x2/2 /i(x) (21) 

Substituting (21) into (17), we obtain: 

-JT 2 h(x) ~ 2x~ h(x) + (2e - 1) h(x) = 0 (22) 

dx dx 

We are going to show how this equation can be solved by expanding h(x) in a 
power series. Then we shall impose the condition that its solutions be physically 
acceptable. 

* The solutions of equation (17) are not necessarily equivalent to e** 12 or e"* 2/2 when x co : 
the intuitive arguments which we have given do not exclude, for example, the possibility that <p(i) may 
behave like the product of e xl/2 or e~ x2/2 and a power of jc. 
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b. THE CALCULATION OF h(x) IN THE FORM OF A SERIES EXPANSION 


As we pointed out in §A-3 of chapter V, the solutions of equation (1) [or, 
which amounts to the same thing, of (17)] can be sought amongst either even or 
odd functions. Since the function e _i2/2 is even, we can therefore set: 

h(x) = x p (a 0 + a 2 x 2 + a 4 x 4 + ... + a 2m x 2m + ...) (23) 

with a 0 # 0 (where a 0 x p is, by definition, the first non-zero term of the expansion) 
Writing (23) in the form: 

M*)= I a 2m x 2m+p (24) 

m = 0 

we easily obtain: 

4 Hi) = i {2m + p) a 2m tf 2 -*'-' 1 (25) 

UX m = 0 

and: 

-jr 2 h(x) = £ (2 m + p){2m + p - 1 ) a 2m x (2m + p ~ 2) (26) 

dx m = 0 

Let us substitute (24), (25) and (26) into (22). For this equation to be satisfied, 
each term of the series expansion of the left-hand side must be equal to zero. For 
the general term in x 2m+p , this condition is written: 

(2 m + p + 2)(2 m + p + \)a 2m+2 = (4m + 2p - 2e + 1) a 2m (27) 

The term of lowest degree is in x p ~ 2 ; its coefficient will be zero if: 

P(P~l) a o = ° (28) 

Since a 0 is not zero, we therefore have either p = 0 [the function <p(x) is then even] 
or p = 1 [<p(x) is then odd]. 

Relation (27) can be written: 


_ 4m + 2p + 1 — 2e 
2 m +2 (2 m + p + 2)(2 m + p + l) a2m 


(29) 


which is a recurrence relation between the coefficients a 2m . Since a 0 is not zero, (29) 
enables us to calculate a 2 in terms of a 0 , a A in terms of a 2 , and so on. 

For arbitrary e, we therefore have the series expansion of two linearly 
independent solutions of equation (22), corresponding respectively to p = 0 
and p = 1. 


c. QUANTIZATION OF THE ENERGY 

We most now choose, from amongst all the solutions found in the preceding 
section, those which satisfy the physical conditions that <p(x) be bounded 
everywhere. 
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For most values of e, the numerator of (29) does not go to zero for any 
positive or zero integer m. Since none of the coefficients a 2m is then zero, the 
series has an infinite number of terms. 

It can be shown that the asymptotic behavior of such a series makes it 
physically unacceptable. We see from (29) that: 


a 2m — « m 

Now consider the power series expansion of the function (where A is a real 
parameter): 


— H ^2 


< 32 > 

For this second series, we therefore have: 

*>2, + 2 • Wj ^ +1 j_A (33) 

b 2m (m + 1 )! r m + 1 m ' 

If we choose the value of the parameter A such that: 

0 < A < 1 (34) 

we see from (30) and (33) that there exists an integer M such that the condi¬ 
tion m > M implies: 

> 0 (35) 


We can deduce from this that, when condition (34) is satisfied, we have: 


x~ p h(x) - P(Z )| > 


°2 M \-Ax 2 


- Q(x )| 


where .P(jc) and Q(x) are polynomials of degree 2 M given by the first M + 1 terms 
of series (23) and (31). When x approaches infinity, (36) gives: 


|M*)| 


*2M £p q2x 2 


and therefore: 


^ M 2M jjp g(2-l/2)5 2 
b-, w 


Since we can choose A such that: 

1/2 < A < 1 (39) 

|<p(jc)| is not bounded when x —> oo. We must therefore reject this solution, which 
makes no sense physically. 
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There is only one possibility left: that the numerator of (29) goes to zero 
for a value m 0 of m. We then have: 


f a 2m * 0 if 
l«2m = 0 if 


m =$ m 0 
m > m 0 


(40) 


and the power series expansion of h(x) reduces to a polynomial of degree 2 m 0 + p. 
The behavior at infinity of 0(x) is then determined by that of the exponential e“ x2/2 , 
and <p{x) is physically acceptable (since it is square-integrable). 

The fact that the numerator of (29) goes to zero at m = m 0 imposes the 
condition : 


2e = 2(2m 0 + p) + 1 
If we set: 


(41) 


2 m 0 + p = n (42) 

equation (41) can be written: 

£ = £(( = „ + 1/2 (43) 

where n is an arbitrary positive integer or zero (since m is an arbitrary positive 
integer or zero and since p is equal to 0 or 1). Condition (43) introduces the 
quantization of the harmonic oscillator energy, since it implies \cf. (16)]: 



We have thus obtained relation (B-34) of chapter V. 


d. STATIONARY WAVE FUNCTIONS 

The polynomial method also yields the eigenfunctions associated with the 
various energies E n , in the form: 

#.<*) = *„(*) (45) 

where h n (x) is an nth degree polynomial. According to (23) and (42), h n (x) is an 
even function if n is even and an odd function if n is odd. 

The ground state is obtained for n = 0, that is, for m 0 = p = 0; h 0 (x) is then 
a constant, and: 

<Po(*) = «o e_i2/2 (46) 

A simple calculation shows that, to normalize v> 0 (x) with respect to the variable x, 
it suffices to choose : 


«0 = 71-1/4 

(47) 

Then, using (12), we find: 




<Po(*) = 1,2x212 

(48) 


which is indeed the expression given in chapter V [formula (C-25)]. 
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To the first excited state E l = 3fi<w/2 corresponds n = 1, that is, m 0 = 0 
and p = 1; then has only one term, obtained by a calculation analogous 
to the preceding one: 


<Pi(x) = Q x e * 2 ' 2 

/ 4« 6 \ 1/4 

(p (x) = (-^— ) xe~ fi2x2/2 
\ 7t J 


(49) 


For n = 2, we have m 0 = 1 and p = 0. Relation (29) then yields: 

a 2 = - 2a 0 (50) 

which finally leads to: 



For arbitrary n, h n (x) is the polynomial solution of equation (22), which can be 
written, taking the quantization condition (43) into account: 

[^■ H ^ + 2 "] MiS) = 0 (52) 

We recognize (52) to be the differential equation satisfied by the Hermite 
polynomial H n (x) [see equation (17) of complement B v ]. The polynomial h n (x) is 
therefore proportional to H n (x), where the proportionality factor is determined 
by normalization of (p(x). This is in agreement with formula (35) of complement B v . 


References 

Mathematical treatment of differential equations: Morse and Feshbach (10.13), 
chaps. 5 and 6; Courant and Hilbert (10.11), § V-l 1. 
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Complement D v 

STUDY OF THE STATIONARY STATES 
IN THE {| p>} REPRESENTATION 


1. Wave functions in momentum space 

a. Changing the variable and the function 

b. Determination of $ n (p) 

c. Calculation of the phase factor 

2. Discussion 


The distribution of the possible momenta of a particle in the state | <p„) is given 
by the wave function <p n (p) in the { |/?> } representation, the Fourier transform 
of the wave function <p„{x) in the { | x ) } representation. We shall show in this 
complement that in the case of the harmonic oscillator, the functions <p„ and tp n are 
the same (to within multiplicative factors). Thus, in a stationary state, the proba¬ 
bility distributions of the momentum and the position have similar forms. 


1. Wave functions in momentum space 


a. CHANGING THE VARIABLE AND THE FUNCTION 


In complement C v , we introduced, for simplicity, the operator: 

x = px 

where: 


mco 


'-h 


0 ) 

( 2 ) 


as well as the eigenkets | > of X and the wave function <p(x) in the { | <*$ > } repre¬ 

sentation. We shall follow a similar procedure for the operator: 



We shall therefore call | > the eigenkets of P: 

P | ** > = P | H > (4) 

and denote by $(p) the wave function in the { 17t- > } representation: 

H | (p > (5) 


Just as the ket | ^ > is proportional to the ket | x = x/P >, the ket 17t- > is proportional 
to the ket |p = (ftp). If we change P to l/ph [cf. (1) and (3)], equation (10) of 
complement C v shows that: 

\n P y = s/pn\ P =phpy (6) 
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The wave function (p(p) in the { 1 7 ^ > } representation is therefore related to 
the wave function <p(p) in the { \p} } representation by: 

$(p) = s/ph <p(p = php) (7) 

Furthermore, we can use (6) and relation (10) of complement C v to obtain: 

e ip* 

< I > = — 7 = (8) 

v 2n 


We therefore have, using definition (5) and the closure relation for the { | c* > J basis : 


<p(p) = 


* + 00 

<*p| £i><£sk>d* 

v — TO 


\J J 


+ 00 


e _w * <p(x) dx 


(9) 


The function (p is therefore the Fourier transform of q>. 


b. DETERMINATION OF £„(p) 

We have seen [cf. equation (15) of complement C v ] that the stationary wave 
functions 0(x) of the harmonic oscillator satisfy the equation : 

^-^7 + * 2 J<p(S) = e#fc) (10) 

d 2 

Now, the Fourier transform of —- <p(x) is — p 2 (p{p) and that of x 2 cp(x) is 

dx 2 

The Fourier transform of equation (10) is therefore : 

^ |V ~ J viP) = e v(i>) 

If we compare (10) and (11), we see that the functions <p n and satisfy the 
same differential equation. Moreover, we know that this equation, when e = n + 1/2 
(where n is a positive integer or zero), has only one square-integrable solution (the 
eigenvalues e„ are non-degenerate; cf. chapter V, §B-3). We can conclude that q> n 
and are proportional. Since these two functions are normalized, the propor¬ 
tionality factor is a complex number of modulus 1, so that: 

K(p) = e i#fl <P„ (* = P) ( ,2 ) 

where e' 9 " is a phase factor which we shall now find. 

c. CALCULATION OF THE PHASE FACTOR 

The wave function of the ground state is given by [cf. complement C v , 
formulas (46) and (47)] : 

<p 0 (x) = 7r 1/4 e- i2/2 (13) 


d 2 ^ . 
~— 2 <p(p)'’ 
dp 2 


( 11 ) 
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This is a Gaussian function; its Fourier transform is therefore [c/. appendix I, 
relation (50)]: 

Po (£) = *- 1/4 e -* 2 ' 2 ( 14 ) 

This implies that 8 0 is zero. 

To find 6 n , let us write, in the { |^) } and { |^) } representations, the 
relation: 


a' <P n > = v n + 1 <P n +i> 


1 d t 

_- n T 


In the { |<L> } representation, X and P act like x and ——; a x therefore 

i dr 

acts like — y= ^x — . In the { \p > } representation, X acts like i and P like p ; 

a* therefore acts like -4= (44 — p). 

y/2 V d P ) 

In the { | <!;-> } representation, relation (15) therefore becomes : 

(,6) 

while in the { | n p > } representation, it becomes: 

(17) 

We therefore have: 

e ,# »+* = — i e i$H (18) 

that is, knowing that 6 0 = 0 : 

e ,# " = (- if (19) 

Thus we obtain: 

kip) = (“ 0" = P) ( 2 °) 

or, returning to the functions (p n and <p n : 

m = { - ,r 7Tn v ( x =lk) (21) 


2. Discussion 

Consider a particle in the state | <p„ >. When the position of the particle 
is measured, one has a probability p n (pc) dx of finding a result between x and x + dx, 
where p„(x) is given by: 

Pn ( x ) = . h ,(*)| 2 


544 


( 22 ) 



STATIONARY STATES IN THE j | p > j REPRESENTATION 


Similarly, in a measurement of the momentum of the particle, one has a proba¬ 
bility p n {p) dp of finding a result between p and p 4- dp, with : 

P n (P) = |<P»| 2 (23) 

Relation (21) then yields: 



which shows that the momentum distribution in a stationary state has the same form 
as the position distribution. 

We see, for example {cf. fig. 6 of chapter V), that if n is large, pjp) has a peak 
at each of the two values: 


P = ± moix M = ±p M 


(25) 


where p M is the maximum momentum of the classical particle moving in the 
potential well with an energy E n . An argument analogous to the one set forth 
at the end of § C-2 of chapter V enables us to understand this result. When the 
momentum of the classical particle is equal to ± p M , its acceleration is zero (its 
velocity is stationary), and the values + p M of the momentum are, averaging over 
time, the most probable ones. Comment (/) of §D-1 of chapter V concerning the 
probability density p n (x) can easily be transposed to this context; for example, when 
n is large, the root-mean-square deviation AP can be interpreted as being the 
order of magnitude of the distance between the peaks of p n (p) situated a tp = ± p u . 

It is also possible to understand directly from figure 6-a of chapter V why 
these values of the momentum are highly probable when n is large. The wave 
function then performs a large number of oscillations between the two peaks, 
analogous to those of a sinusoid. This happens because the differential 
equation for the wave function [cf. formula (A-l 7) of chapter V] when E > m(o 2 x 2 j 2 
becomes: 

d 2 , , 2 mE . . . .... 

~r ~2 <?(*) + TT “ 0 ( 26 ) 

dx 2 n 2 


which yields, according to the definition of p M : 

(p(x) » A e ip *t xlh + A' e _ip »'* /fi (27) 

The wave function (when n is large) therefore looks like a sinusoid of wavelength h/p M 
over a relatively large region of the Ox axis. This sinusoid can be considered to be 
the sum of two progressive waves [cf. (27)] associated with the two opposite 
momenta + p M (corresponding to the to-and-fro motion of the particle in the well). 
It is not surprising, therefore, that the probability density p„{p) should be large 
in the neighborhood of the values p = + p M . 

An analogous argument also enables us to understand the order of magnitude 
of the product AX. AP. This product is equal to [cf. chap. V, relations (D-6), 
(D-7) and (D-9)]: 

AX. AP = (n +|y = (28) 
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When n increases, the amplitudes x M and p M of the oscillations increase, and the 
product AX. AP takes on values much greater than its minimum value A/2. We 
might wonder why this is the case, since we have seen in several examples that 
when the width AX of a function increases, the width AP of its Fourier transform 
decreases. This is indeed what would happen for the functions (p n (x) if, in the 
interval — x M < x < + x M where they take on non-negligible values, they varied 
slowly, reaching, for example, a single maximum or minimum. This is in fact the 
case for small values of n , for which the value of the product AX. AP is indeed 
near its minimum. However, when n is large, the functions (p n (x) perform numerous 
oscillations in the interval — x M ^ x ^ + x M , where they have n zeros. One 
can therefore associate with them wavelengths of the order of X a* x M jn a: AX/n, 
corresponding to momenta of the particle situated in a domain of dimension AP 
given by: 


AP ~ 


h nh 
I ~~AX 


(29) 


We thus find again that: 

AX. AP at nh (30) 

This situation is somewhat analogous to the one studied in §1 of complement A„„ 
in connection with the infinite one-dimensional well. 
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THE ISOTROPIC THREE-DIMENSIONAL HARMONIC OSCILLATOR 


1. The Hamiltonian operator 

2. Separation of variables in Cartesian coordinates 

3. Degeneracy of the energy levels 

In chapter V, we studied the one-dimensional harmonic oscillator. We shall show 
here how we can use the results of this study to treat the three-dimensional harmonic 
oscillator. 


1. The Hamiltonian operator 

Consider a spinless particle of mass m which can move in three-dimensional 
space. The particle is subject to a central force (that is, a force which is constantly 
directed towards the coordinate origin O ) whose absolute value is proportional 
to the distance of the particle from the point O : 

F = -kr (1) 

(k is a positive constant). 

This force field is derived from the potential energy : 

I//_\ _ 1 l, -2_1 


F(r) = ^ k r 2 = ^ V 


where the angular frequency u> is defined as for the one-dimensional harmonic 
oscillator: 

IT 

co = I- (3) 

V m 

The classical Hamiltonian is therefore : 


Jf (r, p) 


r l ? , 

-z- + ~ mco z r 2 
2 m 2 


From this we immediately deduce the Hamiltonian operator, using the quantization 
rules (chap. Ill, §B-5): 

H = h + i mw2R2 |5) 

Since the Hamiltonian H is time-independent, we shall solve its eigenvalue equation: 
H | * > = E | * > (6) 

where | \p ) belongs to the state space S, of a particle in three-dimensional space. 
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comment: 

Since V(r) depends only on the distance r = |r| of the particle from 
the origin [F(r) is consequently invariant under an arbitrary rotation], this 
harmonic oscillator is said to be isotropic. Nevertheless, the calculations 
which follow can easily be generalized to the case of an anisotropic harmonic 
oscillator, for which: 

F(r) = j (a> 2 x 2 + (o 2 y 2 + co\ z 2 ) (7) 

where the three constants oj x , (o y and oj z are different. 


2. Separation of the variables in Cartesian coordinates 

Recall that the state space S t can be considered (cf. chap. II, §F) to be the 
tensor product: 

S t = S x ® S y ® i z (8) 

where S x is the state space of a particle moving along Ox, that is, the space 
associated with the wave functions <p(x). S y and S z are defined analogously. 

Now, expression (5) for the Hamiltonian H can be written in the form: 

h= 'L { p * + p y + p2) + 5 ma)2{x2 + y2 + z2) 

= H X + H y + H z (9) 

with: 

H ‘ = ’S, + i mo,Ix2 (10) 

and similar definitions for H y and H z . H x is a function only of X and P x : H x is 

therefore the extension into S f of an operator which actually acts in i x . Similarly, 

H y and H z act only in S y and i z respectively. In S x , H x is a one-dimensional har¬ 
monic oscillator Hamiltonian. The same is true for H y and H z in S y and S z . 

H x , H y and H z commute. Each of them therefore commutes with their 
sum H. Consequently, the eigenvalue equation (6) can be solved by seeking the 
eigenvectors of H which are also eigenvectors of H x , H y and H z . Now, we already 
know the eigenvectors and eigenvalues of H x in i x , as well as those of H y in S y 
and of H z in S z : 

H x \<P„ X > =(n x + ^)*<o\<t>n x >-,\<P nx >e£ x (H-a) 

H y I ^ | <Pn y > ; I <Pn y >e£ y (11 -b) 

H z \<p ni > =(n z +^fia>\<p ni > ; \ <p n , )e£ z (11-c) 

(n x , n y and n z are positive integers or zero). From this we deduce {cf. chap. II, §F) 
that the eigenstates common to H, H x , H y and H_ are of the form: 

| 'I'nx.ny.n, > = | <Pn x > | <Pn y > | <Pn , > (12) 
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According to equations (9) and (11): 


H I ) = (n x + n y + n 2 + jjTuo \ ^„ x ,„ y>Hi > 


that is, the eigenvectors of /Aare seen to be tensor products of eigenvectors of H x , 
H y and H z respectively, and the eigenvalues of H, to be sums of eigenvalues of 
these three operators. 

According to equation (13), the energy levels E n of the isotropic three- 
dimensional harmonic oscillator are of the form: 

E n = («+f)fo» (14) 

with: 

n a positive integer or zero (15) 

since n is the sum n x + n y + n z of three numbers, each of which can take on any 
non-negative integral value. 

Furthermore, formula (12) enables us to deduce the properties of the 
vectors 1 1 P„ xi „ yt „ i >, common eigenstates of H, H x , H y and H z , from those derived 
in §C-1 of chapter V for | <p nx ) (which are also valid for \<p„ y } and | (p ni )). 

Let us introduce three pairs of creation and annihilation operators : 


S x+ 


imco __ 

“• = v' 


Imfuo 


-Jlrriha) 


a<= —X 
x y] lit 


V 2 mhco 


„»= I— Y _ l —P 

’ V 2 * ' 


(16-a) 


(16-b) 


- 7 _l_ 1 

V 2 * 


v 2mfuo 


(16-c) 


These operators are the extensions into S, of operators acting in $ x , S y and S z . 
The canonical commutation relations between the components of R and P imply 
that the only non-zero commutators of the six operators defined in (16) are: 

I>*,<£] = = [ a„a \] = 1 (17) 

Note that two operators with different indices always commute, as is logical 
because they act in different spaces. The action of the operators a x and a x on 
the states \^> nxByiIli > is given by the formulas: 

a x I 'l / n x , ny .« I > = faxl<P„ X >) I <Pn y > I <Pn t > 

= A I <Pn x -l > | <P« y > \<P«, > 

= VA I »„.»,> ON 

^x I &it x ,n y ,n z ) fax I *Pn x 'l ) I *Pn y ^ | *Pn z ^ 

= >/»,+ ! | (p nx+l > | <P„ y > | (p„ z > 

= Jn x + 1 | > (18-b) 


For a v , a\ and a z , a *, we have analogous relations. 
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We also know [cf. equation (C-13) of chapter V] that: 

\(pn x > = -4= (a’)”* I (p 0 > 

V n x \ 


( 19 ) 


where | (p 0 > is the vector of S x which satisfies the condition: 

a x\<Po'> = 0 ( 20 ) 

In i y and S z , there are analogous expressions for | q> ny > and | (p„ z >. Consequently, 
according to (12): 


•A 


n x ,n yt n z 


> = 




Hyl n z 




( 21 ) 


where | iA 0 ,o,o) is the tensor product of the ground states of the three one¬ 
dimensional oscillators, so that: 

a x | ^0,0,0 y = a y I ^0,0,0 y = I ^0,0,0 ^ = ® ( 22 ) 


Finally, recall that, since | 'i'n x ,*>,n x > * s a tensor product, the associated wave 
function is of the form: 


< r I > = <?»*(*) 9>«,00 


(23) 


where <p„ x , (p„ y and are stationary wave functions of the one-dimensional 
harmonic oscillator (chap. V, §C-2). For example: 


(. r I o y 


'mu>\ -t-y 2 + z 2 ) 


(24) 


3. Degeneracy of the energy levels 

We showed in §B-3of chapter V that H x constitutes a C.S.C.O. in S x ; the same 
is true for H y in S y and for H z in S z . According to § F of chapter II, { H x , H y , H z } 
is thus a C.S.C.O. in ^.Therefore, there exists (to within a constant factor) a unique 
ket | ^ nx>ny>nt > of S t corresponding to a given set of eigenvalues of H x , H y and H z , 
that is, to fixed non-negative integers n x , n y and n z . 

However, H alone does not form a C.S.C.O. since the energy levels E n 
are degenerate. If we choose an eigenvalue of H, E n = (n 4- 2>/2)hoj (which 
amounts to fixing a non-negative integer n), all the kets of the { | ^„ Xt „ y ,„ z ) } 
basis which satisfy: 

n x + n y + n z = n (25) 

are eigenvectors of H with the eigenvalue E„. 

The degree of degeneracy g„ of E n is therefore equal to the number of 
different sets { n x , n y , n z } which satisfy condition (25). To find g n , we can 
proceed as follows : with n fixed, choose n x first, giving it one of the values: 

= 0,1,2, (26) 
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With n x thus fixed, we must have: 
n y 4- n z = n - n x 

There are then (n — n x + 1) possibilities for the pair { n y , n z }: 

{ « r n z } = ( 0- n ~ n x }. { 1, n - n x - 1 },... { n - n x , 0 } 
The degree of degeneracy g n of E n is therefore equal to: 

n 

g«= £ (« ~ n x + i) 

»x = 0 


This sum is easy to calculate: 

£. = (» + !) E 1 “ t n x 

n x - 0 = 0 


(n + 1X« + 2) 
2 


Consequently, only the ground state E 0 =-Tuo is non-degenerate. 


comment : 


( 27 ) 

( 28 ) 

( 29 ) 

( 30 ) 


The kets | > constitute an orthonormal system of eigenvectors 

of H, which forms a basis in i t . Since the eigenvalues E n of H are degenerate, 
this system is not unique. We shall see in particular in complement B vn that, in 
order to solve equation (6), it is possible to use a set of constants of the 
motion other than { H x , H y , H z }: thus we obtain a basis of £ r which is 
different from the preceding one, although still consisting of eigenvectors 
of H. The kets of this new basis are orthonormal linear combinations of 
the | *l/ nx By „ x > belonging to each of the eigen subspaces of H, that is, 
corresponding to a fixed value of the sum n x + n y + n z . 
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Complement F v 

A CHARGED HARMONIC OSCILLATOR 
IN A UNIFORM ELECTRIC FIELD 


1. Eigenvalue equation of H \&) in the { | x > } representation 

2. Discussion 

a. Electrical susceptibility of an elastically bound electron 

b. Interpretation of the energy shift 

3. Use of the translation operator 

The one-dimensional harmonic oscillator studied in chapter V consists of a 
particle of mass m having a potential energy: 

V(X) = | m(0 2 X 2 (1) 

Assume, in addition, that this particle has a charge q and that it is placed in a 
uniform electric field $ parallel to Ox. What are its new stationary states and 
the corresponding energies ? 

The classical potential energy of a particle placed in a uniform field $ is 
equal to *: 

w{&) = — qSx ( 2 ) 

To obtain the quantum mechanical Hamiltonian operator H'(S) in the presence 
of the field S, we must therefore add to the potential energy (I) of the harmonic 
oscillator the term: 

W{£) = - qSX ( 3 ) 

which gives: 

H'{S) mco 2 X 2 - qSX (4) 

We must now find the eigenvalues and eigenvectors of this operator. 
To this end, we shall use two different methods. First, we shall consider directly 
the eigenvalue equation of H'(S) in the { | x > } representation; it is very simple 
to interpret the results obtained. Then we shall show how the problem can be 
solved in a purely operator formalism. 


1. Eigenvalue equation of H'(S) 
in the { |x > } representation 

Let | <p' ) be an eigenvector of H'(S)\ 

H'(S) | <P' > = E' | <p' > (5) 


* We use the convention of zero potential energy for the particle at x = 0. 
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charged harmonic oscillator in an electric field 


Using (4), we can write this equation in the { | x ) } representation: 


7i 2 d 2 1 2 2 

-- H— mo) x — 


2m dx 2 2 


(p\x) = E'(p'( 


Completing the square with respect to x on the left-hand side of (6): 


~ n 2 d 2 | i 

2m dx 2 2 


marl x — 


q 2 i 2 

2mo) 2 


(p'(x) = E'(p\x) 


Let us now replace the variable x by a new variable u, setting: 

qi 


u = x — 


Through the intermediary of x, (p' is then a function of u, and equation (7) 
becomes: 


fi 2 d 2 1 

-- -1- - mo) 2 u 2 (p\u) = E" <p'( 

2m du 2 2 J 


with: 

E" = E' + (10) 

2mti) 2 

Thus we see that equation (9) is the same as the one which enabled us to obtain 
the stationary states of the harmonic oscillator in the absence of an electric field 
in the { | x ) } representation [c/. chap. V, relation (A-17)]. Therefore, we have 
already solved this equation, and we know that the acceptable values of E” are 
given by: 

K = ( n + l) n( ° ( n ) 


(where n is a positive integer or zero). 

Relations (10) and (11) show that in the presence of the electric field, the 
energies E' of the stationary states of the harmonic oscillator are'modified: 

,,2) 

The entire spectrum of the harmonic oscillator is therefore shifted by the quantity 
q 2 $ 2 l2ma) 2 . 

Now, let us show that the eigenfunctions (p' n {x ) associated with the energies (12) 
can all be obtained from the (p„{x) by a translation along Ox. The solution of (9) 
corresponding to a given value of n is <p„{u) [where the function <p n is given, for 
example, by formula (35) of complement B v ]. According to (8), we have : 

<Pn(x) = <P«( X ~ ( 13 ) 

V mar/ 

This translation comes from the fact that the electric field exerts a force on the 
particle*. 


* It can be seen from (13) that the function <p' (x ) is obtained from cp n (x ) by a translation of qSjmoy 2 ; 
if the product q& is positive, the translation is performed in the positive A-direction, which is indeed 
the direction of the force exerted by 


553 




COMPLEMENT F 



COMMENT : 

The change of variable given in (8) allows us to reduce the case of an 
arbitrary electric field to an already solved problem, the one in which 8 was 
zero. The only effect of the electric field is to change the x-origin [cf. (13)] 
and the energy origin [cf. (12)]. This result can easily be understood 
graphically (cf. fig. 1). When 8 is zero, the potential energy V(x) is repre¬ 
sented by a parabola centered at O. When 8 is not zero, it is necessary 
to add to this potential energy the quantity — q8x, which corresponds to 
the dashed line in this figure; the curve representing V + W is again a 
parabola. Thus, in the presence of the field 8 we still have a harmonic 
oscillator. Since the two parabolas are superposable, they correspond to the 
same value of a> and therefore to the same energy difference between the levels. 
However, their minima O and O' are different, as is consistent with 
formulas (12) and (13). 


2. Discussion 

a. ELECTRICAL SUSCEPTIBILITY OF AN ELASTICALLY BOUND ELECTRON 

In certain situations, the electrons of an atom or a molecule behave, to a good 
approximation, as if they were “elastically bound”, that is, as if each of them 
were a harmonic oscillator. We shall prove this for atoms in complement A xm , 
using time-dependent perturbation theory. 

The contribution of each electron to the electric dipole moment of the atom 
is described by the operator: 

D = qX (14) 

where q is the charge of the electron (q < 0) and X the corresponding position 
observable. We are going to examine the mean value of D in the model of the 
elastically bound electron. 
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In the absence of an electric field, the mean value of the electric dipole 
moment in a stationary state of the oscillator is zero: 

(D)=q(<p n \X\<p„> = 0 (15) 


[see formulas (D-l) of chapter V], 

Now, let us assume that the field $ is turned on so slowly that the state 
of the electron changes gradually from | <p n > to | <p' n > (n remaining the same). 
The mean dipole moment is now different from zero, since: 


< D >' = q < ip' n | X \qf n > 


dx X |<p;(x)| 3 


(16) 


Using (8) and (13), we obtain: 


<Z>>' 



(17) 


since the first integral is zero by symmetry. < D )' is therefore proportional to S. 
In this model, the electrical susceptibility of the atomic electron under consideration 
is equal to: 


1 $ 



(18) 


It is positive, whatever the sign of q. 

It is simple to interpret result (18) physically. The effect of the electric 
field is to shift the classical equilibrium position of the electron, that is, the mean 
value of its position in quantum mechanics [see formula (13)]. This results in the 
appearance of an induced dipole moment, x decreases when g> increases because 
the oscillator is less easily deformable when the restoring force (which is propor¬ 
tional to ft) 2 ) is larger. 


b. INTERPRETATION OF THE ENERGY SHIFT 


Using the model just described, we can interpret formula (12) by calculating 
the variation in the mean kinetic and potential energies of the electron when it passes 
from the state | </?„ ) to the state | <p' >. 

The variation in the kinetic energy is, in fact, zero (as can be understood 
intuitively from figure 1, for example): 


P 2 

<^> 


' P 2 h 2 r f + ce d 2 

-<— >= - — <Pn*(x)—(p'n(x) dx 

2m 2m [_•/-<» dx 

- J <(*)-^ < /’„(x)dxj = ° (19) 


according to formula (13). 
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The variation in the potential energy can be treated in two terms: 

- the first term, < W(S) >', corresponds to the electrical potential energy of 
the dipole in the field S. Since the dipole is parallel to the field, we have, according to 
(17): 

< W{S) y = - ^ (DY = - ^ (20) 

ma> 

— the second term, < V(X ))' — < V(X) >, arises from the electric field 
modification of the wave function of the level labeled by the quantum number n. 
The “elastic” potential energy of the particle therefore changes by a quantity: 

< *1*) >' - < T(X) > = ^ m(o 2 ^ j* x 2 \(p'„(x)\ 2 dx - J* x 2 |<p„(x)| 2 dx J 

( 21 ) 


The first integral can be calculated by using (13) and the change of variable (8): 
f * 2 knMl 2 dx = [ u 2 \<p„{u)\ 2 d u + j* u \(p„{u)\ 2 dw 

J-oo J-co 

+ (S) L k(u)|2du (22) 

Since (j>„(u) is normalized, and since the integral of u \(p n {u)\ 2 is zero by symmetry, 
we obtain, finally: 

<V(X)y - <V(X)>=-^1 (23) 

2ma) 


We see why this result should be positive, since the electric field moves the 
particle away from the point O and attracts it into a region where the “elastic” 
potential energy V(x) is larger. 

Adding (20) to (23), we again find that the energy of the state | <p' n > is less 
than that of the state | (p„ > by q 2 i 2 j2rmo 2 . 


3. Use of the translation operator 

We shall see in this section that, instead of using the { | x > } representation, 
as we have done until now, we can argue directly in terms of the operator H'{$) 
given in (4). More precisely, we are going to show that a unitary transformation 
(which corresponds to a translation of the wave function along the x-axis) 
transforms the operator H = H'(S = 0) into the operator H'(S) (to within an 
additive constant which does not change the eigenvectors). Since the eigenvectors and 
eigenvalues of H were determined in chapter V, this approach enables us to solve 
our problem. 

Therefore, consider the operator: 

U(X) = (24) 
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where X is a real constant. Its adjoint U*(X) is : 

U\X) = e X(a ~ a,) 

It is clear that: 

U(X) U\X) = U\X) U(X) = 1 


(25) 


(26) 


U(X) is therefore a unitary operator. Under the corresponding unitary transfor¬ 
mation, H becomes: 


H = U{X) H U\X) 


= hoj 


j + V(X) a'a U'(X) 


We must now calculate the operator: 

U{X)a'a U\X) = a'a 

with: 

a = U{X) a U\X) 

S' = U(X) a' U\X) 

To obtain a and a', we use formula (63) of complement B„ (which can be applied 
here since the commutator of a and a* is equal to 1), which yields: 


(27) 

(28) 
(29) 


U(X) = c~ Xa+Xa ' = e~ Xa e Xa ' e x2/2 

C/ r (^.) = Q ~ Xa ' + Xa = e ~ Xa ' e Aa q ~ x2 ’ 2 


{! 


Also, formula (51) of complement B„ enables us to write: 

[e'^a'] = - Xe^ 

[e Afl \ a] = - X e*“’ 

that is: 

Q -xa a t e _ fl t _ x 

e Xa ' a e ~ Xa< = a - X 
Thus it follows that: 

a = e~ Xa e Aa ' a e - ^' e Afl 
= e Xa (a - X)e Xa = a — X 

and, similarly: 


a' = a 1 


X 


(30) 

(31) 

(32) 

(33) 

(34) 


H is therefore given by: 


= hw 


[j + („> - A) (a - A)j 
ft + a'a - Ma + a') + A 2 J 


= H — XTuo (a + a') + X 2 h(o 


(35) 
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Since (a + a *) is proportional to the operator X [formulas (B-l) and (B-7) of 
chapter V], it suffices to set: 


x= 3i pn 

co yj 2 mh(o 

to obtain: 

H = H - qSX + £2— = H\£) + 

Imco 1 Imoj 2 


(36) 

(37) 


The two operators H and H'(S) therefore have the same eigenvectors, and their 
eigenvalues differ by q 2 S 2 \lmo) 2 . Now, we know (cf. complement C„, §2) that 

if the eigenvectors of H are the kets | <p n >, those of H are the kets: 

I Vn> = m | <p n > (38) 

and the corresponding eigenvalues of H and H are the same. The stationary 
states | <p' n > of the harmonic oscillator in the presence of the field $ are therefore 

the state | <p n > given by (38). The associated eigenvalue of H'{$) is, according 
to (37): 



which is the same as formula (12) of the preceding section. Expression (38) 
for the eigenvectors can be put into the form: 

| <Pn > = | <P n > = e"‘^ P | </>„ > (40) 

using (24) and (36), as well as formulas (B-l) and (B-7) of chapter V. We interpreted, 
in complement E,„ the operator c~ iaP > h as being the translation operator over 
an algebraic distance a along Ox. \ (p ' n ) is therefore the state obtained from 
| (p n ) by a translation qSjmo) 2 , just as is indicated by formula (13). 


References 

The elastically bound electron : see references of complement A xl „. 
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Complement G v 

COHERENT “QUASI-CLASSICAL" STATES 
OF THE HARMONIC OSCILLATOR 


1. Quasi-classical states 

a. Introducting the parameter ct 0 to characterize a classical motion 

b. Conditions defining quasi-classical states 

c. Quasi-classical states are eigenvectors of the operator a 

2. Properties of the | a > states 

a. Expansion of \ a ) on the basis of the stationary states \ <p n ) 

b. Possible values of the energy in an \ a ) state 

c. Calculation of < X >, < P >, AX and AP in an | a > state 

d. The operator D(<x); the wave functions 4/ a (x) 

e. The scalar product of two | a ) states. Closure relation 

3. Time evolution of a quasi-classical state 

a. A quasi-classical state always remains an eigenvector of a 

b. Evolution of physical properties 

c. Motion of the wave packet 

4. Example: quantum mechanical treatment of a macroscopic oscillator. 


The properties of the stationary states | (p n > of the harmonic oscillator were 
studied in chapter V; for example, in § D, we saw that the mean values < X ) 
and < P > of the position and the momentum of the oscillator are zero in such 
a state. Now, in classical mechanics, it is well-known that the position x and the 
momentum p are oscillating functions of time, which always remain zero only 
if the energy of the motion is also zero [cf. chap. V, relations (A-5) and (A-8)]. 
Furthermore, we know that quantum mechanics must yield the same results as 
classical mechanics in the limiting case where the harmonic oscillator has an 
energy much greater than the quantum hoy (limit of large quantum numbers). 

Thus, we may ask the following question : is it possible to construct quantum 
mechanical states leading to physical predictions which are almost identical 
to the classical ones, at least for a macroscopic oscillator? We shall see in this 
complement that such quantum states exist: they are coherent linear super¬ 
positions of all the states | (p n >. We shall call them “quasi-classical states” or 
“coherent states of the harmonic oscillator”. 

The problem we are considering here is of great general interest in quantum 
mechanics. As we saw in the introduction to chapter V and in complement A v , 
many physical systems can be likened to a harmonic oscillator, at least to a first 
approximation. For all these systems, it is important to understand, in the 
framework of quantum mechanics, how to move gradually from the case in which 
the results given by the classical approximation are sufficient to the case in 
which quantum effects are preponderant. Electromagnetic radiation is a very 
important example of such a system. Depending on the experiment, it either 
reveals its quantum mechanical nature (as is the case in the experiment discussed 
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in § A-2-a of chapter I, in which the light intensity is very low) or else can be 
treated classically. “Coherent states” of electromagnetic radiation were recently 
introduced by Glauber and are now in current use in the domain of quantum optics. 

The position, the momentum, and the energy of a harmonic oscillator are 
described in quantum mechanics by operators which do not commute; they are 
incompatible physical quantities. It is not possible, therefore, to construct a state 
in which they are all perfectly well-defined. We shall confine ourselves to looking for 
a state vector such that, for all t, the mean values < X >, < P > and < H > are as 
close as possible to the corresponding classical values. This will lead us to a 
compromise in which none of these three observables is perfectly known. We 
shall see, nevertheless, that the root-mean-square deviations AX, AP and AH are, 
in the macroscopic limit, completely negligible. 


1. Quasi-classical states 


a. INTRODUCING THE PARAMETER a 0 

TO CHARACTERIZE A CLASSICAL MOTION 


The classical equations of motion of a one-dimensional harmonic oscillator, 
of mass m and angular frequency o), are written: 



^ Pit) = - mw 2 x(t) 


(1-a) 

(1-b) 


The quantum mechanical calculations we shall perform later will be simplified by 
the introduction of the dimensionless quantities: 

'flO-MO 

m = Tp m < 2 > 

where: 



Equations (1) can then be written: 


5*1 

II 

s 

(4-a) 

p(t) = - O) x(t) 

(4-b) 


The classical state of the harmonic oscillator is determined at the time t when 
we know its position x(t) and its momentum p(t), that is, jc(r) and p(t). We shall 
therefore combine these two real numbers into a single dimensionless complex 
number a(f) defined by : 

ot(f) = ~ [ x{t) + i p{t) ] (5) 

v 2 
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The set of two equations (4) is equivalent to the single equation : 

^ a(t) = - /to a(0 (6) 

whose solution is: 

aft) = a 0 e _, “' (7) 

where we have set: 

« 0 = «(0 )=_L[x(°) + ip( 0 )] (8) 

v 2 

Now consider the points M 0 and M in the complex plane that correspond 
to the complex numbers a 0 and a(f) [fig. 1]. M is at M 0 at t = 0 and describes, 
with an angular velocity — to, a circle centered at O of radius OM 0 . 



Re a U) 

FIGURE l 

The point M(t), which corresponds to the complex 
number a(t), characterizes the state of the harmonic 
oscillator at each instant. M moves in a circle with an 
angular velocity — a>. The abscissa and ordinate of M 
give the position and momentum of the oscillator. 


Since, according to (5), the coordinates of M are equal to xft)l\fl and pft)l\fl ,, 
we thus obtain a very simple geometrical representation of the time evolution 
of the state of the system. Every possible motion corresponding to given initial 
conditions is entirely characterized by the point M 0 , that is, by the complex 
number a 0 (the modulus of a 0 gives the amplitude of the oscillation and the 
argument of a 0 , its phase). According to (5) and (7), we have: 

f *(t) = 4= t «o e“ 1 + a* e'"‘ ] (9-a) 


pft) = - — [« 0 e-'“' - (9-b) 

L v 2 

As for the classical energy of the system, it is constant in time and equal to: 

* = 2 ^ + 1 m< ° 2 W 0 ^ 2 


[ x ( 0)] 2 + mV 
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which yields, taking (8) into account: 

= h(0\a o \ 2 ( 11 ) 

For a macroscopic oscillator, the energy is much greater than the quantum 
hu>, so: 

|“o| > 1 (12) 


b. CONDITIONS DEFINING QUASI-CLASSICAL STATES 


We are looking for a quantum mechanical state for which at every instant 
the mean values < X >, < P > and < H > are practically equal to the values x, p 
and which correspond to a given classical motion. 

To calculate < X ), < P ) and < H ), we use the expressions: 


X 


P 


= HX = -±-(a + a *) 
v2 





(13) 


and: 

H - Tico^a'a + ^ (14) 

For an arbitrary state | ij/(t )), the time evolution of the matrix element 
< a y(t) = < i (/(t) | a | <//(t)) is given by (cf. §D-l-d of chapter III): 

mf t (a}(t) = < [a, H] >(t) (15) 


Now: 

[a, H] = ha> [a, a'a] = hot a (16) 

which implies: 

‘(H < a >( f > = w < a >( f ) ( 17 ) 

that is: 

< a >(t) = < a >(0) e““°' (18) 

The evolution of < «*•>(/) = < ij/(t) | a 1 \ \j/(t) > obeys the complex conjugate 
equation: 

< a’ >(<) = < a x >(0) e i,u ' 

= < a>*(0)e i<of (19) 


(18) and (19) are analogous to the classical equation (7). 
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Substituting (18) and (19) into (13), we obtain: 


< X >(0 = —= [< a >(0) e _i “‘ + < a >*(0) e‘“‘] 
v2 


<>X0=- 7 = [< >(0) e 

v 2 


< a >*(0) e'“'] 


Comparing these results with (9), we see that, in order to have at all times t: 

r < x xo = *(o , 2n 

it is necessary and sufficient to set, at the instant t = 0, the condition : 

v. < a >(0) = «o (22) 

wliere a 0 is the complex parameter characterizing the classical motion which we are 
trying to reproduce quantum mechanically. The normalized state vector | i{/(t) > 
of the oscillator must therefore satisfy the condition : 

< m | a | <X0) > = a 0 (23) 

We must now require its mean value : 

< H > = Tuo < fl f fl X0) + (24) 

to be equal to the classical energy given by (11). Since, for a classical oscillator, 
|a 0 | is much greater than 1 \cf. (12)], we shall neglect the term Tho/2 (of purely 
quantum mechanical origin; see §D-2 of chapter V) with respect to fuo |a 0 | 2 . 
The second condition on the state vector can now be written : 

< (Pa X0) = |« 0 | 2 (25) 

that is: 

< i/^(0)| a’fl | ^(0)> = |a 0 | 2 (26) 

We shall see that conditions (23) and (26) are sufficient to determine the 
normalized state vector 1 1 //( 0 ) > (to within a constant phase factor). 


c. QUASI-CLASSICAL STATES ARE EIGENVECTORS OF THE OPERATOR a 

We introduce the operator b(a 0 ) defined by : 

b(ct 0 ) = a - a o (27) 

We then have : 

b\a 0 )b(oc 0 ) = a 1 a - a 0 a f - a %a + a£a 0 (28) 

and the square of the norm of the ket b(a 0 ) \ ^(0) > is : 

< <M0) I *A*o) ^(«o)| m > = 

< \p(0) | da | ^(0) > - a o <^(0)|«»|^(0)> 

- “o < <A(0) | a | W) > + ct*ct 0 (29) 
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Substituting into this relation conditions (23) and (26), we obtain : 

< #)) | bVo) f>(a 0 ) | <//(0) > = a*a 0 - a 0 a* - a£a 0 + a£a 0 = 0 (30) 

The ket 6(a 0 ) | tj/(0) >, whose norm is zero, is therefore zero : 

b(a 0 )\m> = o (3i) 

that is: 

A I #>) > = «o | m > (32) 

Conversely, if the normalized vector 1i//(0)) satisfies this relation, it is 
obvious that conditions (23) and (26) are satisfied. 

We therefore arrive at the following result: the quasi-classical state, asso¬ 
ciated with a classical motion characterized by the parameter a 0 , is such that 
1 1 //( 0 ) ) is an eigenvector of the operator a with the eigenvalue a 0 . 

In what follows, we shall denote the eigenvector of a with eigenvalue a by 

|a>: 

a | a > = a | a > (33) 

[we shall show later that the solution of (33) is unique to within a constant 
factor]. 


2. Properties of the | a > states 

a. EXPANSION OF | a > 

ON THE BASIS OF THE STATIONARY STATES | <p n > 

Let us determine the ket | a ) which is a solution of (33) by using an 
expansion on the states | <p n > : 

| « > = Z I Qn > ( 34 ) 

n 

We then have: 

«|«> = I>„(a)\/n| </>„_! > (35) 

n 

and, substituting this relation into (33), we obtain: 

c n + 1(«) = - ~T= ^(«) (36) 

V n + 1 

This relation enables us to determine by recurrence all the coefficients c n (a) in 
terms of c 0 («): 

oc n 

^ n (a) = —f= <-o( a ) (37) 

v n! 

It follows that, when c 0 (a) is fixed, all the c„(a) are also fixed. The vector | a > 
is therefore unique to within a multiplicative factor. We shall choose c 0 (a) real and 
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positive and normalize the ket | a >, which determines it completely. In this case, 
the coefficients c„(«) satisfy: 

I M «)| 2 = 1 ( 3 $ 


that is: 




With the convention we have chosen: 


c 0 (a) = e 
and, finally: 


= p-|«l 2 /2 


|a> = e-W' 2 I-2=|^> 

» v n\ 


b. POSSIBLE VALUES OF THE ENERGY IN AN | a > STATE 

Let us consider an oscillator in the state | a ). We see from (41) that a 
measurement of the energy can yield the result E n — (n + \j2)Tuo with the 
probability: 

&M) = k»| 2 = e ~ H * ( 42 ) 

The probability distribution obtained, & n (a), is therefore a Poisson distribution. 
Since: 


«=- L ^-.(«) (43) 

it is easy to verify that &„{a) reaches its maximum value when: 

n = the integral part of |a| 2 (44) 

To calculate the mean value < H > a of the energy, we can use (42) and 
the expression: 


> a = I« "+! na> 

n L - 


Nevertheless, it is quicker to notice that, since the adjoint relation of (33) is: 

< a | = a* < a | (46) 

we have: 

< a I a f a I a ) = a*a (47) 


and therefore: 


H > a = Tio) < a | a*a + | a > = fia) |a| 2 + ^ 


565 



COMPLEMENT 6 V 


Comparing this result to (44), we see that, when |a| pi, < H >„ is not very 
different, in relative value, from the energy E n which corresponds to the maximum 
value of 0*(oc). 

Let us calculate the mean value < H 2 > a : 


< H 2 > a = ft W < 


a | [a'a + | a 


Using (33) and the fact that [a, u f ] = 1, we easily obtain: 


H 2 > a = ft 2 a> 2 |a| 4 + 2 |a| 2 + 


from which we get: 

AH a = ftco |a| 

If we compare (48) and (51), we see that, if |a| is very large, we have: 
1 

The relative value of the energy of the state | a > is very well-defined. 
comment : 


Since: 


-H) 


we immediately obtain from (48) and (51): 

f < N >” T|i a i 2 ( 54 ) 

Thus we see that, to obtain a quasi-classical state, we must linearly 
superpose a very large number of states | <p„ > since AN a P 1. However, 
the relative value of the dispersion over N is very small: 

1 

<"n>" = , R * 1 < 55 > 


c. CALCULATION OF < X >. < P >. AX AND AP IN AN |a> STATE 

The mean values < X > and < P > can be obtained by expressing X and P in 
terms of a and a* [formula (13)] and using (33) and (46). We obtain: 


<X> a =<a|X|«> = ^L Re (a) 
<f > > a = <a|f > |a> = xflmho) Im (a) 
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An analogous calculation yields: 


<* 2 X 


h 

2ma) 


[ (a + a*) 2 + 1 ] 


<P 2 >. 


mho 

2~ 


[I - (a - a*) 1 ] 


and therefore: 


AX, 

AP a 



(57) 


(58) 


Neither AX, nor AP, depends on a. Note also that AX . AP takes on its minimum 
value: 


AX,.AP, = h/2 


(59) 


d. THE OPERATOR O(a); THE WAVE FUNCTIONS &(x) 


Consider the operator D( a) defined by: 

D(a) = e“ a ' _ “* fl (60) 

This operator is unitary, since: 

D’(a) = e a * a " M ' (61) 

immediately implies: 

D(a)D f (a) = D\ot)D(a) = 1 (62) 


Since the commutator of the operators a a 1 and — a*a is equal to a*a, which is a 
number, we can use relation (63) of complement B„ to write: 

D(a) = e~ M1 < 2 e m 'e~ a ' a (63) 


Now let us calculate the ket D( a) | (p 0 ); since: 
e | (Po > = 

then: 

°( a ) | Qo > = e~ |a|2/2 e™' | <p 0 > 

n " * 

= e -H>/2 Z J^ kB> 

» \/n! 


1 - a*u + a 2 + ... J | <p 0 > 
<Po > 


(64) 


(65) 
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Comparing (41) and (65), we see that : 

| a > = D(a) | <p 0 > (66) 

Z)(a) is therefore the unitary transformation which transforms the ground state 
| (p 0 > into the quasi-classical state | a >. 

Formula (66) will enable us to obtain the wave function: 

\l/ a (x) = < x | a > (67) 

which characterizes the quasi-classical state | a ) in the { | x ) } representation. 
To calculate: 

= < x | D(a) | <p 0 > (68) 

we shall write the operator ua* — a*a in terms of X and P: 


/ a-a*^ 

I Y . / 

( a + a* \ 


1 ^ '-- 1 

' V mjjft) ’ 

L >/i J 


Using formula (63) of complement B„ again, we obtain: 

j moj a—a* v i a+a* n a* 2 —a 2 

D( a) = e aa, ~ a * a = ~7T P 

Substituting this result into (68), we find: 

a* 2 -a 2 frtno a-a*.^ i a + a * ^ 

il/ a (x) — e 4 <( x I ^ e y/mha Ji | <p 0 > 


a 2 -a 2 j ma) a -a* i a + a* 

= e 4 ev /fi '/ 2 *<x|e >/2 | (p 0 ) (71) 

Now, the operator c~ UPIf ’ is the translation operator of A along Ox (cf. 
complement E„): 

< x | e -^k ? 5r ,, = <x- ^A-(a + a*)| (72) 

Relation (71) therefore yields: 


a* 2 -a 2 f™ 0 a ~ g * 

i^ a (x) = e 4 e v n Si 


" J 2^ (a + “*> 


If we write a and a* in terms of < X > a and < P > a [formulas (56)], <p a (x) becomes: 

Hx) = e"* e i<p> ' xlh <p 0 (x -<*>«) (74) 

where the global phase factor e Wl is defined by: 


= e 4 
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Relation (74) shows that i (/ a (x) can easily be obtained from the wave function <p 0 {x) 
of the ground state of the oscillator : it is sufficient to translate this function 
along Ox by the quantity <A') a and to multiply it by the oscillating exponen¬ 
tial e ,<p> « x/ft (since the factor e ifl * plays no physical role, it can be omitted)*. 

If we replace (p 0 in (74) by its explicit expression, we obtain, finally : 

Mx) - «- (=)'" exp{- [S-IaT 2 ! + 1 '< P '>■ l} (76) 

The form of the wave packet associated with the | a ) state is therefore given by : 



For any | a > state, we obtain a Gaussian wave packet. This result should be 
compared with the fact that the product AX a . AP a is always minimal (c/. comple¬ 
ment C m ). 


e. THE SCALAR PRODUCT OF TWO | a > STATES. 
CLOSURE RELATION 


The | a ) states are eigenvectors of the non-Hermitian operator a. There is therefore 
no obvious reason for these states to satisfy orthogonality and closure relations. In this section, 
we shall investigate this question. 

First, we shall consider two eigenkets | a ) and | a' ) of the operator a . Relation (41) 
gives their scalar product, since : 

<«|sO = I«^1 (78) 

rt 

We therefore have : 

< a | a' > = 

1 n\ 

n 

= e' |a|2/2 e _|4 |2/2 e 1 * 5 ' (79) 


from which we conclude : 

|< a | a ' >| 2 = e -' 4 - 4 '' 2 


(80) 


This scalar product is therefore never zero. 

However, we shall show that the | a > states do satisfy a closure relation, which is 
written: 


71 



a><a|d{Rea}d{Ima} = 1 


(81) 


* The exponential e KP> ** /fi is obviously not a global phase factor since its value depends on x. 
The presence of this exponential in (74) insures that the mean value of P in the state described by ^ a (x) 
be equal to < P > a . 
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To do so, we replace | a ), on the left-hand side of (81), by its expression (41). This yields : 

lf[e-W 2 X-^k B >I^<^l d ^ Rea } d nni a } (82) 

71 JJ n yj n\ rn \J m\ 

that is, going into polar coordinates in the complex a plane (setting a = p c i<p ): 




2 * n + m 

dip e“ p2 X 9 -= I 

o nm y/n\m\ 


<Pn > < <Pm 


The integral over ip is easily calculated : 


I 


2k 


e io.- m »d <p = 2nS m 


which yields for (83): 

I | 9>n > < 9>n I 

II 

with: 

00 /» 0() 

/„ = 2 I p Ap e~ pi p ln = j 

•'o •'o 

Integrating by parts, we find a recurrence relation for the /„: 
= n 

whose solution is: 

L = nll 0 - n! 


(83) 


(84) 


(85) 


( 86 ) 


(87) 


( 88 ) 


Substituting this result into (85), we see that the left-hand side of formula (81) can finally be 
written: 


I \<Pn > < <P J 

ft 

which proves that formula. 


(89) 


3. Time evolution of a quasi-classical state 


Consider a harmonic oscillator which, at the instant t = 0, is in a particu¬ 
lar | a > state: 

I m > h « 0 > ( 9 °) 

How do its physical properties evolve over time? We already know ( cf. § I-b) that 
the mean values <X)(t) and <P>(?) always remain equal to the corresponding 
classical values. We shall now study other interesting properties of the state 
vector 1 1 l/(t)}. 
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a. A QUASI-CLASSICAL STATE ALWAYS REMAINS AN EIGENVECTOR OF a 

Starting with (41), we can use the general rule to obtain |i j/(t)} when the 
Hamiltonian is not time-dependent (cf. chap. Ill, §D-2-a): 

| y = e - l a oi 2 /2 V — p ~ iE * , i h | y," y 

* vn! 

-.n a — inwt 

= e - ,0,,/2 e -|ao,2/2 £ ° I (Pn > (91) 

» v»! 

If we compare this result with (41), we see that, to go from 1 1(/{0) > = | a 0 > to [ \j/(t) ), 
all we must do is change a 0 to a 0 e -,0> ' and multiply the ket obtained by e _i “' /2 
(which is a global phase factor with no physical consequences): 

|#)> = = a 0 e-■■“"> (92) 

In other words, we see that a quasi-classical state remains an eigenvector of a for 
all time, with an eigenvalue a 0 e ~ which is nothing more than the parameter a(t) 
of figure 1 (corresponding to the point M), which characterizes at all times the 
classical oscillator whose motion is reproduced by the state | i//(/) ). 


b. EVOLUTION OF PHYSICAL PROPERTIES 


Using (92) and changing a to a 0 e in (56), we immediately obtain : 


< x ><' ) = V= Re[a<,e “"‘ ] 

. < P ) (0 = n /. 2 mfiiD Im [a 0 e _i "'] 


(93) 


As predicted, these equations are similar to the classical equations (9). 

The average energy of the oscillator is time-independent: 

< H > = fico |ja 0 | 2 + jj (94) 

According to (51) and (58), the root-mean-square deviations AH, AX and AP are 
equal to: 


AH = Tw) |a 0 


and: 


AX 

AP 



(95) 


(96) 


AX and AP are not time-dependent; the wave packet remains a minimum wave 
packet for all time. 
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c. MOTION OF THE WAVE PACKET 


Let us calculate the wave function at time t : 

I p(x, t) = < X | ll/(t) > 

where 1 1 j/(t)y is given by (92). From (76), we obtain : 


i^(x, t) = e ,9 ‘ 


met) 


ns) 


e -, '“ ,/2 e 


.x<P>(f) 


r x-<X>(Q -|2 
> L 2 AX J 


( 97 ) 


(98) 


At /,the wave packet is still Gaussian. Its form does not vary with time, since: 
^(x, r)| 2 = \(p 0 [x - < X >(t)]| 2 (99) 

Thus, it remains “minimum” for all time [cf. (96)]. 

Figure 2 shows the motion of the wave packet, which performs a periodic 
oscillation (T = 2it/(o) along the Gx axis, without becoming distorted. We saw 
in complement G, that a Gaussian wave packet, when it is free, becomes distorted 
as it propagates, since its width varies (“spreading” of the wave packet). We see 
here that nothing of the sort occurs when a wave packet is subject to the influence 
of a parabolic potential. Physically, this result arises from the fact that the tendency 
of the wave packet to spread is compensated by the potential, whose effect is to 
push the wave packet towards the origin from regions where V(x) is large. 

What happens to these results when |a| is very large? The root-mean-square 
deviations AX and AP do not change, as is shown by (96). On the other hand, the 


A . 

‘|*(*.0l 2 

t 

.....—.. X 

fv .. 

0 

' A. 


FIGURE 2 

A. Motion of the Gaussian wave packet 

\ associated with an | a ) state : under 

\ the effect of the parabolic potential 

V ^ K(x), the wave packet oscillates without 


> x becoming distorted. 
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oscillation amplitudes of < X}(t) and <^>(0 become much larger than AX 
and AP. By choosing a sufficiently large value of |a|, one can obtain a quantum 
mechanical motion for which the position and momentum of the oscillator are, 
in relative value, as well-defined as might be desired. Therefore when |oc| > 1, 
an | a > state describes very well the motion of a macroscopic harmonic oscillator, 
for which the position, the momentum and the energy can be considered to be 
classical quantities. 


4. Example: 

quantum mechanical treatment of a macroscopic oscillator 


Let us consider a concrete example: a macroscopic body of mass m = 1 kg 
suspended from a rope of length / = 0.1 m and placed in the gravitational 
field (g as 10 m/s 2 ). We know that, for small oscillations, the period T of the 
motion is given by : 


T = 



( 100 ) 


In our case, we obtain: 
T as 0.63 s 
to = 10 rd/s 


( 101 ) 


Let us now assume that this oscillator performs a periodic motion of 
amplitude x M = 1 cm. What is the quantum mechanical state which best represents 
its oscillation ? 

We have seen that this state is an |a> state in which, according to (93), 
a satisfies the relation: 


■ i maj 

M - 

(102) 

that is, in our case: 


|a| as y/s x 10 15 as 2.2 x 10 15 §> 1 

(103) 


(the argument of a is determined by the initial phase of the motion). 
The root-mean-square deviations AX and AP are then: 


AX 

AP 


rr 

■J 2 ttuo 



V 2 


a: 2.2 x 10 18 m x M 
a; 2.2 x 10“ 17 kg m/s 


(104) 


The root-mean-square deviation A V of the velocity is equal to: 

AV a: 2.2 x 10" 17 m/s (105) 

Since the maximum velocity of the oscillator is 0.1 m/s, we see that the 
uncertainties of its position and its velocity are completely negligible compared 
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to the quantities involved in the problem. For example, AX is less than a 
Fermi (10 -15 m), that is, the approximate size of a nucleus. Measuring a macroscopic 
length with this accuracy is obviously out of the question. 

Finally, the energy of the oscillator is known with an excellent relative 
accuracy, since, according to (52): 

TF) ~ TSf “ °' 4 * 10 " ! « 1 (106) 

The laws of classical mechanics are therefore quite adequate for the study of the 
evolution of the macroscopic oscillator. 


References and suggestions for further reading: 

Glauber’s lectures in (15.2). 
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Complement H v 

NORMAL VIBRATIONAL MODES 

OF TWO COUPLED HARMONIC OSCILLATORS 


1. Vibrations of the two particles in classical mechanics 

a. Solving the equations of motion 

b. The physical meaning of each of the modes 

c. Motion of the system in the general case 

2. Vibrational states of the system in quantum mechanics 

a. Commutation relations 

b. Transformation of the Hamiltonian operator 

c. Stationary states of the system 

d. Evolution of the mean values 


This complement is devoted to the study of the motion of two coupled (one¬ 
dimensional) harmonic oscillators. Such a study is of interest because it permits 
the introduction, in a very simple case, of a physically important concept : that 
of normal vibrational modes. This concept, encountered in quantum mechanics as 
well as in classical mechanics, appears in numerous problems: for example, in 
the study of atomic vibrations of a crystal (cf. complement J v ) and of the 
vibrations of electromagnetic radiation (cf. complement K v ). 

Let us therefore consider two particles (1) and (2), of the same mass m, 
moving along the Ox axis, with abscissas x 1 and x 2 . To begin, we assume their 
potential energy to be: 


U 0 ( Xl , x 2 ) = ^ mco 2 (x l - a) 2 + ^ m<w 2 (x 2 -I- a) 2 


( 1 ) 


When x t = a and x 2 = — a, the potential energy U 0 (x 19 x 2 ) is minimal, and the 
two particles are in stable equilibrium. If the particles move from these equi¬ 
librium positions, they are subject to the forces F r and F 2 , respectively: 


F i = ~ u o( x n * 2 ) = - moj 2 (x 1 - a) 

F i = ~ U 0 ( Xl , x 2 ) = - mco 2 (x 2 + a) 


( 2 ) 


and their motion is given by: 


m —x,(f) = — mco 2 (x, — a) 
d t 2 




m 



— mco 2 (x 2 + a) 


(3) 
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Each particle is therefore in independent sinusoidal motion, centered at its equi¬ 
librium position. The amplitude of the motion of each particle is arbitrary* 
and can be fixed by a suitable choice of the initial conditions. 

Now let us assume the potential energy U(x lt x 2 ) of the two particles to be: 

U{x x ,x 2 ) = U 0 (x l ,x 2 ) + V{ Xl ,x 2 ) (4) 

with: 

V{x it x 2 ) = kma) 2 {x l - x 2 f (5) 

(where a is a dimensionless positive constant which we shall call the coupling 

constant). To the forces F x and F 2 written in (2), we must add, respectively, the 
forces F\ and F 2 given by: 

*1 = “ ^ 7 - v ( x u x 2 ) = 2Xma) 2 (x 2 - xj 

** ( 6 ) 

F 2 = — F(x x , x 2 ) = 2km(o 2 (x 1 — x 2 ) 

We see that the introduction of V(x y , x 2 ) takes into account an attractive 
force between the particles which is proportional to the distance between them. 
The two particles (1) and (2) are therefore no longer independent; what is their 
motion now? Before attacking this problem from a quantum mechanical point 
of view, we shall recall the results given by classical mechanics. 


1. Vibration of the two particles in classical mechanics 

a. SOLVING THE EQUATIONS OF MOTION 

In the presence of the coupling F(x,, x 2 ), we must replace (3) by the system 
of coupled differential equations: 

d 2 

m —- Xj(r) = — ma> 2 (x, — a) + 2kma> 2 (x 2 — x,) 

dr 2 , , 

1 d> (7) 

m - x 2 (t) = — mo) 2 (x 2 + a) + 2hnu> 2 (x, — x 2 ) 

. dt 2 

We know how to solve such a system ( cj for example, chapter IV §C-3-a). 
We diagonalize the matrix K of the coefficients appearing on the right-hand side 
of (7): 

„ _ (1 + 22 - 2A \ 


K = - moj 2 


1 + 2k 


We then replace x^t) and x 2 (t) by linear combinations of these two functions 
(given by the eigenvectors of K) whose time dependence obeys uncoupled linear 
differential equations (with coefficients which are the eigenvalues of A"). 


* Of course, the choice of the potential (I) implies that we are not taking into account the colli¬ 
sions that could occur if sufficiently large amplitudes were chosen. 
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In this case, these linear combinations are: 

*c(0 = \ [*i(0 + *z(0] (9) 

(the position of the center of mass of the two particles) and : 

Xr(') = *i(t) ~ * 2 (0 (10) 


(the abscissa of the “relative particle”). Substituting (9) and (10) into (7), we 
obtain (taking the sum and the difference): 

d 2 ., 

— T*g (0 = - «> 2 x G (/) 
at 

d 2 

—7 x R (t) = - to 2 [x R (t) - 2a] - 42io 2 x R (t) (11) 

dr 2 


These equations can be integrated immediately : 
' x G (r) = xg cos ( a> G t + d G ) 

< 

**(') = i " - + ~ 4^ + X °R COS + B r) 


with: 


{ 


(Oq 

0) R 


= 1 0 

= to v' 1 +4 k 


( 12 ) 


(13) 


x G , x R , Q g and d R are integration constants fixed by the initial conditions. To 
obtain the motion of particles (1) and (2), all we must do is invert formulas (9) 
and ( 10 ): 

*i (0 = * g (0 + \ 

j (14) 

X 2 (f) = x G (t) - - X R (t) 


and substitute ( 12 ) into these equations. 


b. THE PHYSICAL MEANING OF EACH OF THE MODES 

Through the change of functions performed in (9) and ( 10 ), we have been 
able to find the motion of the two interacting particles by associating with them 
two fictitious particles (G) and (/?), of abscissas x G (?) and x R (t). These fictitious 
particles do not interact; their motions are independent, so their amplitudes and 
phases can be fixed arbitrarily by a suitable choice of the initial conditions. For 
example, it is possible to require one of the two fictitious particles to be motionless 
without this being the case for the other one: we then say that a vibrational mode 
of the system is excited. It must be understood that, in a vibrational mode, the 
real particles ( 1 ) and ( 2 ) are both in motion with the same angular frequency (tt> R 
or co G , depending on the mode). No solution of the equations of motion exists for 
which one of the two real particles ( 1 ) or ( 2 ) remains motionless while the other 
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one vibrates. If, at the instant t = 0, one were to give an initial velocity to only 
one of the two particles, ( 1 ) or ( 2 ), the coupling force would set the other one 
in motion (cf. discussion of §c below). 

The simplest case is of course the one in which neither of the two modes is 
excited. In formulas (12) such a situation corresponds to x G = x R = 0; x G (f) 
and x J{ (t) then always remain equal to zero and 2<a/( 1 + 4X) respectively, which, 
according to (14), yields : 


*i = - *2 


a 

mx 


(15) 


The system does not oscillate and the two particles ( 1 ) and ( 2 ) remain motionless 
in their new equilibrium positions given by (15) (it can be verified that, for these 
values of x x and x 2 , the forces exerted on the particles are zero; the fact that 
these equilibrium positions are closer in the presence of the coupling than 
when X = 0 is due to their mutual attraction). 

To excite only the mode corresponding to x G (f), one places the two par¬ 
ticles (1) and (2) at the initial instant at the same distance 2a/(l + 4X) as in the 
preceding case, and one gives them equal velocities. One then finds that x R (t) 
remains equal to 2a/(l + 4X) (the initial conditions require x R to be zero). The 
two particles move “ in unison ”, performing the same motion without the distance 
between them varying. For this mode, the two-particle system can be treated like 
a single undeformable particle of mass 2m on which is exerted the 
force F l + F 2 = — 2 mco 2 x G (t). We then see why the angular frequency of this 
mode is <o G = a> [cf. formula (A-3) of chapter V]. 

To excite only the mode corresponding to x R (t), one chooses an initial state 
in which the positions and initial velocities of the two particles are opposite. One 
then finds that, at every subsequent instant, x G (t) = 0 , and the two particles move 
symmetrically with respect to the origin O. For this mode, the distance (x 2 — x x ) 
varies and the attractive force between the two particles enters into the equations 
of motion; this is the reason why the angular frequency of this mode is not to 

but (o R = o) v 1 + 4X. 

The dynamical variables x G (r) and x R (t) associated with the independent modes, 
that is, with the fictitious particles (G) and (/?), are called normal variables. 


C. MOTION OF THE SYSTEM IN THE GENERAL CASE 

In the general case, both modes are excited and the positions x x (f) and x 2 (t) 
are both given by the superposition of two oscillations of different frequencies co G 
and co R [cf. formulas (14)]. The motion of the system is not periodic, except 
in the case in which the ratio (o G /a> R is rational*. 

Let us investigate, for example, what happens if, at the initial time t 0 , 
particle ( 1 ) is motionless at its equilibrium position x x = a/(l + 4X), while 
particle ( 2 ) has a non-zero velocity (this is, in classical mechanics, the analogue 
of the problem studied in §C-3-b of chapter IV). In the absence of coupling, 


* If (o G /(o R = l/v l + 4a is equal to an irreducible rational fraction pjp 2 . the period of the 
motion is : T = 2npJco G - 2np 2 /a> R . 
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particle (2) would oscillate alone and particle (1) would remain motionless. 
We shall show that the coupling sets particle (1) in motion. Two different angular 
frequencies co G and a> R appear in the time evolution of jc^f) and x 2 (t). The two 
corresponding oscillations give rise to a beat phenomenon (fig. 1), whose 
frequency is: 



to 

271 


(Vl +42 


1 ] 


(16) 



w 


FIGURE 1 

Oscillations of the position of par¬ 
ticle (1), assumed to be motionless 
at its equilibrium position at t - 0, 
particle (2) having an initial velo¬ 
city. A beat phenomenon is pro¬ 
duced between the two modes, 
and the amplitude of the oscil¬ 
lations of particle (1) varies over 
time. 


If the coupling is weak (2 < 1 ), this frequency v ^ 2 co/tt is negligible with respect 
to to R and to G . In this case, as long as (t — t 0 ) < 1/v, particle (2) is practically the 
only one to oscillate; the vibrational energy is then slowly transferred to particle ( 1 ), 
whose amplitude of oscillation increases, while that of (2) decreases. After a certain 
time, the original situation is inverted : particle ( 1 ) oscillates strongly while 
particle (2) is practically motionless. Then the amplitude of (1) slowly decreases 
and that of ( 2 ) increases until the energy is again almost entirely localized in 
oscillator ( 2 ). The same process is repeated indefinitely. The effect of a weak 
coupling is to cause the energy of the oscillator associated with particle ( 1 ) to be 
constantly transferred to the one associated with particle ( 2 ) and vice versa, with 
a frequency proportional to the intensity of the coupling. 

comments : 

(i) If p l and p 2 are the respective momenta of particles (1) and (2), the classical 
Hamiltonian of the system under study can be written : 

jf(x v x 2 , p t . Pi) = 2 m + 2 ^ + u o(Xi> x z) + V(x lt x 2 ) (17) 


If we set: 

>g( 0 = Pi(0 + Pi(t) 
pR.(t) Pi(t) - P 2 (t) ] 


(18) 
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H G = 2m 


it can be verified that becomes: 


* = TT - + ,/*g<»g*g + TT + o/9t<»l - 


2a 

m 


+ mat a 


1 + 4A 


By a suitable change in the energy origin, one can eliminate the last term 
of this expression, which is constant. Jf? can then be seen to be the sum of two 
energies, each of which corresponds to one of the modes. Unlike the situation 
in (17), in which the terms in x,x 2 of V(x t , x 2 ) are responsible for a coupling 
between the particles, there is no coupling term in (20) between the modes: they 
are indeed independent. 

(//) We have assumed, for simplicity, the masses m l and m 2 of particles (l) and (2) 
to be equal. It is easy to eliminate this hypothesis by replacing (9), (10), (18) 
and (19)by: 


* G (0 = 


m 1 x 1 (t) + m 2 x 2 (t) 
m l + m 2 


Pg(*) = Pi(t) + Pz(t) 

+m 2 (21) 

(the position, momentum and mass associated with the center of mass) and: 

' x K (r) = x,(f) - x 2 (t) 

p m) = HisM 

< R ’ m l + m 2 


(the position, momentum and mass of the “relative particle”). One then finds 
a result analogous to (20). 


(in) In the absence of coupling, the two modes have the same angular frequency a>; 
in the presence of coupling, two different angular frequencies c o G and co R 
appear. This is an example of a result which is often found in physics: the 
effect of a coupling between two oscillations is, in most cases, to separate 
their normal frequencies (the same phenomenon would occur here if the 
two oscillators originally had different angular frequencies). If, instead of 
two, we have an infinite number of oscillators (which, if isolated, would have 
the same frequency), we shall see in complement J v that the effect of the 
coupling is to create an infinite number of different frequencies for the modes. 
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2. Vibrational states of the system in quantum mechanics 

Let us now reconsider the problem from a quantum mechanical point of view. 
We must now replace the positions x y (t), x 2 (t) and the momenta p 2 (t) of the 
particles by operators, which we shall denote, respectively, by X y , X 2 , P x , P 2 . We 
then introduce, by analogy with (9), (10) and (18 ), the observables : 

r* G =^(*i +* 2 ) 


L P G = Pl +p 2 

(23) 

'Xr = X, - x 2 

\ p R=\( p i - Pi) 

(24) 


To see if the operator H, the Hamiltonian of the system, can be put into a form 
analogous to (20), we shall begin by examining the commutation relations of X G , 
Pg 1 X R and P R . 

a. COMMUTATION RELATIONS 

Since all the observables concerning only particle (1) commute with those 
concerning particle (2), the only non-zero commutators involving X y , X 2 , P y 


and P 2 are: 

[*i. Pi] = * 

[X 2 , P 2 ] = ih (25) 

In particular, X t commutes with X 2 , and we see immediately that: 

[A'q, T r ] = 0 (26) 

Similarly: 

[/> G ,.P R ]=0 (27) 

Calculating the commutator [X G , P G \ we obtain : 

[* 0 .+[Jf|.P 2 ] +[^2.P|] 

= -( ih + ih} = ih (28) 

Similarly, one finds: 

[X R ,P R ]=i1i (29) 

The two commutators \_X G , P R ] and [A^, P G ] remain to be examined; they are 
equal to: 

[* e , Pr] = \ { l*v Pi] ~ [* 1 . Pi] + IX2, PJ ~ [* 2 , P 2 ] } 

= \{i* ~ /»} =0 (30) 
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and, similarly: 

[Z R ,/> G ] = 0 (31) 

We can thus consider X G , P G and X R , P R to be the position and momentum 
operators of two distinct particles. Formulas (28) and (29) are the canonical 
commutation relations for each of these particles. Moreover, relations (26), (27), 
(30) and (31) express the fact that all the observables concerning one of them 
commute with all those which concern the other one. 


b. TRANSFORMATION OF THE HAMILTONIAN OPERATOR 

In the presence of the coupling V (X t , X 2 ), we have : 
H = T+ U 


T = 2 ^ ( p i + p i) ( 33 ) 

(the kinetic energy operator) and: 

U = mo 2 [ (X, - a) 2 + ( X 2 + a) 2 + 2X(X 2 - X 2 ) 2 ] (34) 

(the potential energy operator). Since P 1 and P 2 commute, (33) can be transformed 
as if these operators were numbers; we find : 

T =We Pl+ ir/ 2 > < 35 > 


where n G and fi R are defined in (19). Similarly, since X x and X 2 commute, we have, 
as above [formula (20)] : 


U ~ ~2 Pg^G^G ~2 


2 fl V . 2 2 4 A 

TT1 +mo,a TTTi 


where <x> G and co R are given by (13). 

Thus we see that H can be put into a form which is analogous to (20), in 
which there is no coupling term : 


H = H r . + H R + mo) 2 a 2 


1 +42 


with: 


Hg= -0- g + ^PgO> 2 g X 2 

H * = Tit R + l - TTTx\ 
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c. STATIONARY STATES OF THE SYSTEM 


The state space of the system is the tensor product <f(l) ® (2) of the state 

spaces of particles (1) and (2); it is also the tensor product <f(G) 0 S{R) of the 
state spaces of the fictitious particles, the “center of mass” and the “ relative particle” 
associated with each of the two modes. Since H is the sum of two operators H G 

4A 

and H r which act only in ^(G) and S(R) respectively (the constant ma> 2 a 2 --— 

1 4* 4 / 

merely introduces a shift in the energy origin), we know (chap. II, §F) that we can 
find a basis of eigenvectors of H in the form : 

\<P > = | > \<P R > (39) 


where | q> G ) and | <p R > are, respectively, eigenvectors of H G and H R in ${G ) and S’(R). 
Now, H g and H R are Hamiltonians of one-dimensional harmonic oscillators, whose 
eigenvectors and eigenvalues we know. If the operators a G and a R are defined by : 


- 1 

l»G<°G y 

i p 0 I 

sTi 

< 

LV * a 

J ix G ho) G -1 

a t — ^ 

t - 

/MrWr y, 

1 

s/2 

[ v » R 



with : 


= 


2 a 

1 +4A 


(40-b) 


and if | <p% > and | (p R > denote respectively the ground states of H a and H R , the 
eigenvectors of H G are the vectors : 


\/ n! 


«*?> —j=(«sri«»g> 


(41) 


whose eigenvalues are: 



those of H r being : 

I <pp y = 7 = ( a }t) p 1 Vo y 
vp! 

with the eigenvalues: 



(42) 


(43) 


(44) 
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Thus we have here a situation which is analogous to the one encountered 
in the study of a two-dimensional anisotropic (since co G # a> R ) harmonic oscillator. 
The stationary states of the system are given by : 


> = I <Pn > l<Pp > = 


(«£)" K) p , 


v^o.o y 


and their energies are : 



+ ma) 2 a 2 


41 

1 + 4A 




ho) R + mco 2 a 2 


4 1 

1 +41 


(46) 


The operators a G and or (a R and a' R ) can thus be seen to be destruction 
or creation operators of an energy quantum in the mode corresponding 
to (G) [or (/?)]. We see from (45) that, through the repeated action of a G and a R , 
we can obtain stationary states of the system in which the number of quanta in 
each mode is arbitrary. The action of a G ,a G , a R or a R on the stationary states | (p n p ) 
is very simple: 

ah\<Pn,p> = >/n + 1 k„ + i, P > 

<* G \<Pn, P > = \/«kn-l,p> 

I <Pn.p > = y/p + 1 I (p„, p+1 > 

(*r k»,p> = Vpk».p-i > ( 47 ) 

In general, there are no degenerate levels since there do not exist two 
different pairs of integers {«,/>} and { p' } such that: 

nco G + po R = n'o) G + p'w R (48) 

(except when the ratio o) R /co G = Vl + 4A is rational). 


d. EVOLUTION OF THE MEAN VALUES 

The most general state of the system is a linear superposition of stationary 
states | <p„ iP >: 

I <p(t) > = Z c n , P ( t ) I Qn.P > ( 49 ) 

",P 

with: 

c„,p{t) = c„J0)Q- iE ”- ,lh (50) 

According to relations (40) and their adjoints, X G (X R ) is a linear combination 
of a G and a G (of a R and a' R ). We then see, by using (47), that A^ G has non-zero 
matrix elements between two states | (p„ rP } and | (p„- tP -} only when « — «'= + 1, 
p = p' (for X' R , we would have n = ri, p — p' = + 1). From this we deduce that 
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the only Bohr frequencies which can appear in the time evolution of < X G }(t) 
and < X R >(/) are, respectively*: 

Eh * 1iP ~ £ "’ p = + (» G 

h 

= ± co R (51) 

Thus we again find that < X G }(t) and < X R )(/) oscillate at angular frequencies 
of (o G and co R , which recalls the classical result obtained in §l-a. 


References and suggestions for further reading : 

Coupling between two classical oscillators: Berkeley 3 (7.1), §§1.4 and 3.3; 
Alonso and Finn (6.1), Vol. I, § 12.10. 


* For these frequencies actually to appear, at least one of the products p c„ p or c* p±x c np 
must be different from zero. 
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Complement J v 

VIBRATIONAL MODES OF AN INFINITE LINEAR CHAIN 
OF COUPLED HARMONIC OSCILLATORS ; PHONONS 


1. Classical treatment 

a. Equations of motion 

b. Simple solutions of the equations of motion 

c. Normal variables 

d. Total energy and energy of each mode 

2. Quantum mechanical treatment 

a. Stationary states in the absence of coupling 

b. Effects of coupling 

c. Normal operators. Commutation relations 

d. Stationary states in the presence of coupling 

3. Application to the study of vibrations in a crystal: phonons 

a. Outline of the problem 

b. Normal modes. The speed of sound in the crystal 


In complement H v , we studied the motion of a system of two coupled harmonic 
oscillators. We concluded, in essence, that, while the individual dynamical variables 
of each oscillator do not evolve independently, it is possible to introduce linear 
combinations of them (normal variables) which possess the important property 
of being uncoupled. Such variables describe vibrational normal modes of well- 
defined frequencies. Expressed in terms of these normal variables, the Hamiltonian 
of the system appears in the form of a sum of Hamiltonians of independent 
harmonic oscillators, thus making quantization simple. 

In this complement we shall show that these ideas are also applicable to a 
system formed by an infinite series of identical harmonic oscillators, regularly spaced 
along an axis, each one coupled to its neighbors. 

To do this, we shall determine the various vibrational normal modes of the 
system and show that each one corresponds to a collective vibration of the system 
of particles characterized by an angular frequency £2 and a wave vector k. The 
process of finding the eigenstates and eigenvalues of the quantum mechanical 
Hamiltonian is then greatly simplified by the fact that the total energy of the 
system is the sum of the energies associated with each vibrational normal mode. 

The results obtained will enable us to indicate how vibrations propagate 
in a crystal and to introduce the concept of a phonon, a central idea in solid state 
physics. Of course, in this complement, we shall emphasize the introduction and 
quantization of the normal modes, and not the detailed properties of phonons, which 
would be treated in a solid state physics course. 
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1. Classical treatment 

a. EQUATIONS OF MOTION 

Let us consider an infinite chain of identical one-dimensional harmonic 
oscillators, each one labeled by an integer q (positive, negative or zero). The 
particle M q of mass m which constitutes oscillator (q) has its equilibrium position 
at the point whose abscissa is ql (fig. 1 ), where / is the unit distance of the oscillator 
chain. We denote by x q the (algebraic) displacement of oscillator (q) with respect to its 
equilibrium position. The state of the system at the instant t is defined by specifying 
the dynamical variables x q (t) and their time derivatives xjt) at this instant. 


to - I)/ 


i-*— 


M„ 


ql 


M. 


<!+i_ 


to + i)/ 

—i- 



FIGURE 1 


'< 1+1 


H 


Infinite chain of oscillators; the displacement of the qth particle with respect to its equilibrium 
position ql is denoted by x q . 


In the absence of interactions between the various particles, the potential 
energy of the system is: 


£/(..., x 


+ 00 1 

!,x 0 , x + „...) = X j ma> 2 x 2 

q = — to ^ 


( 1 ) 


where a> is the angular frequency of each oscillator. The evolution of the system is 
then given by the equations: 



- mio 2 x q {t) 


( 2 ) 


whose solutions are : 

x q{l) = X q COS (cor - (p q ) 


(3) 


where the integration constants x" and (p q are fixed by the initial conditions of the 
motion. The oscillators therefore vibrate independently. 

Now imagine that these particles are interacting. For simplicity, we shall 
assume that one need take into account only the forces exerted on a particle by its 
two nearest neighbors and that these forces are attractive and proportional to the 
distance. Thus, particle (q) is subjected to two new attractive forces exerted by 
particles (# + 1) and (q — 1). These forces are proportional to \M q M q + y \ 
and |A/ 4 A/ <i _ 1 | (the coefficient of proportionality being the same in both eases). The 
total force F q to which particle (q) is subjected can therefore be written: 


- moj 2 x q - mu)\\ql + x q - (q + 1)/ - x 4+1 ] 

- mo)\\ql + x q — (q — 1)/ - x q _ l ] 

- ma> 2 x q - moi\(x q - x„ + t ) - ma){(x q - x q _,) (4) 
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where (o l is a constant [having the dimensions of an inverse time] which characterizes 
the intensity of the coupling. Equations (2) must now be replaced by: 

d 2 

m df 2 *^) = ~ mt ° 2x ^ ~ moj i [ 2x «( f ) “ “ **-i(0] (5) 

It can easily be verified that the interaction forces [terms in a) 2 of (4)] are derived 
from the potential energy of the coupling V given by: 


2 +00 

x_!, x 0 , x + ls ...) = - mo)\ X (*, 

£ q = - oo 


According to (5), the evolution of x q depends on x g+1 and x q _ j. Therefore, 
we must solve an infinite system of coupled differential equations. Before we introduce 
new variables which allow these equations to be uncoupled, it is interesting to try 
to find simple solutions of equations (5) and investigate their physical significance. 


b. SIMPLE SOLUTIONS OF THE EQUATIONS OF MOTION 

a. Existence of simple solutions 

The infinite chain of coupled oscillators which we are studying is analogous 
to an infinite macroscopic spring. Now, we know that progressive longitudinal 
waves (corresponding to expansions and compressions) can propagate along this 
spring. Under the influence of a sinusoidal wave of this type, of wave vector k and 
angular frequency Q, the point of the spring whose abscissa is x at equilibrium 
is found at time t, at x + «(x, t), with: 

u(x, t) = p e‘ (kx ~ n,) + p* ( 7 ) 

Such solutions of the equations of motion (5) do indeed exist. However, since 
the oscillator chain is not a continuous medium, the effects of the wave are observed 
only at a series of points, corresponding to the abscissas x = ql\ u(ql, t ) thus 
represents the displacement of oscillator {q) at time t: 

X q (t) = U(ql, t) = p Q^kql-Ot) _|_ e -i(kql-nt) 

It is easy to verify that this expression is a solution of equations (5) if £2 and k satisfy: 

— mQ 2 = — mo) 2 — moy\ [2 — e ikl — e _,7tl ] (9) 

£2 is therefore related to k by the “ dispersion relation ”: 

£2(k) = [a) 2 + 4to 2 sin 2 0°) 

which we shall discuss in detail later (§l-b- 8 ). 

P. Physical interpretation 

In the solution ( 8 ) of the equations of motion, all the oscillators are vibrating 
at the same frequency £2/2n, with the same amplitude \2p\, but with a phase which 
depends periodically on their rest positions. It is as if the displacements of the 
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various oscillators were determined by a progressive sinusoidal wave of wave 
vector k and phase velocity: 

v v (k) = ^r (ii) 

This is easy to show. Using (8), we see that: 




Thus, oscillator (q t + q 2 ) performs the same motion as oscillator (^ t ), shifted by 
the time taken by the wave to travel, at a velocity v 0 , the distance q 2 l separating 
the two oscillators. Since all the oscillators are then in motion, solutions (8) are 
called ” collective modes “ of vibration of the system. 

y. Possible values of the wave vector k 

Consider two values of the wave vector, k and k\ which differ by an integral 
number of 2n/l: 

2 nn 

k' = k ^-— with n an integer (positive or negative) (13) 

We have, obviously: 

f V — pikql 

J (14) 

\Q(k’) = Q(k) 

where the second relation follows directly from (10). 

We see from (8) that the two progressive waves k and k' lead to the same 
motion for the oscillators and are, consequently, physically indistinguishable. 
Therefore in the problem we are studying here, it suffices to let k vary over an 
interval of 27t//. For reasons of symmetry, we choose : 

-7<*<7 (15) 

The corresponding interval is often called the “first Brillouin zone”. 

8. Dispersion relation 

The dispersion relation (10) which gives the angular frequency Q(k) associated 
with each value of k enables us to study the propagation of vibrations in the system. 
If, for example, we form a “wave packet” by superposing waves with different wave 
vectors, we know that it has a group velocity given by: 

di2(fc) 


which is different from tv. 
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Figure 2 shows the form of the variation of Q{k) with respect to k, where k 
varies within the first Brillouin zone. 



FIGURE 2 

Dispersion relation giving the variation of the angular frequency of the vibrational normal modes 
with respect to the wave number k in the first Brillouin zone [— n/l y + 7t//]. The dashed line cor¬ 
responds to the case o) = 0. 


It is clear from this figure that Q(k) cannot take on arbitrary values : a vibration 
of frequency v can propagate freely in the medium only if v falls within the 
“allowed band”: 


oj ^ \J a > 2 4- 4 cof 

— <; y <;- L 

In 2n 


(17) 


The other values of v correspond to “ forbidden bands”. The two limiting frequencies 
of interval (17) are often called “cut-off frequencies”. 

The mode of lowest angular frequency, fi(0) = to, has a zero wave vector k ; 
it corresponds to an in-phase vibration of all the oscillators, whose particles are 
moving “together” without changing their relative distances (fig. 3). This explains 
why the angular frequency of this mode is the same as in the absence of 
coupling (cf. complement H v , §l-b). 


{q-2)l 
-1- 


(q~l)l ql (< 7+1 )/ (<? + 2)1 

- 1 - • i » - 1 -■- 1 -► 


*,-2 X 9 -l X q+I 

FIGURE 3 

The lowest frequency mode (k = 0, Q = w) corresponds to a displacement of the system of oscil¬ 
lators “ as a whole”. This is why its frequency does not depend on the coupling V. 
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As for the mode of highest angular frequency, 0(± n/l) = v'oj 2 + 4m 2 , in the 
corresponding vibration of the system, two adjacent oscillators are completely 
out of phase (fig. 4); the effect of the attractive forces due to the coupling V is then 
maximal. 


(«- 2 K (<7- i)/ 

—i—•-•—i- 


'< 1-2 



ql (q + 1)/ (q + 2)1 

—I-•-•-1 - 1- 



*4+2 


FIGURE 4 

The modes k = ± n/l are those in which two neighboring oscillators are completely out of phase; 
the coupling V strongly modifies their frequency. 


c. NORMAL VARIABLES 


a. Obtaining uncoupled equations 

Returning to the equations of motion (5), let us introduce new dynamical 
variables (linear combinations of the x q ) which evolve independently. 

To do so, we multiply both sides of equation (5) by the quantity e~' kql and 
sum over q. If we notice that: 


+ oo 

I 


q = — oo 


K q±l 


Q-iqkl 


+ 00 

= e*‘" I 

q — — oo 
+ 00 

= e ±,w I 

q — — oo 


Y P “ * (4 ± 1 )W 

X q — iqkl 


and if we set: 

I x q (t)e- iqkl = £(k,t) 

q = — oo 


(18) 


(19) 


we see that (5) becomes: 

%k, t) = - [ m 2 4- a>\{2 - e ik ‘ - e"*")] {(*, t) 
dt 2 


( 20 ) 


that is, taking (9) into account: 


dt 2 


£(k,t) = - Q 2 (k)Z(k,t) 


( 21 ) 


This relation shows that the time evolution of £(k, t) is independent of that of £(k’, t) 
for k' different from k. The quantities £(&, t) introduced in (19) are therefore comple¬ 
tely uncoupled and have a remarkably simple equation of motion. 
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comments: 

(i) Equations (5) are easy to uncouple because the problem is invariant 
under a translation of the system of oscillators by a quantity ± l (replace¬ 
ment of q by q ± 1). This invariance is itself due to the fact that the chain 
is regular and infinite. 

( 11 ) In reality, every chain is of course finite, even if it contains a very large number Jf of 
oscillators. To find its vibrational normal modes, one must thus take into account the 
boundary conditions at the two ends of the chain, and the problem becomes much more 
complicated (edge effects). Instead of obtaining, as we do here, a continuous infinity of 
vibrational normal modes which correspond to the various values of k in the first 
Brillouin zone, one finds a finite number of eigenmodes, equal to the number Jf of 
oscillators. When one is concerned only with the behavior of the chain far from the ends, 
one often introduces artificial boundary conditions which are different from the real 
boundary conditions but which have the advantage of simplifying the calculations while 
conserving the essential physical properties. Thus, one requires the two end oscillators 
to have the same motion (“ periodic ” boundary conditions, also called “ Born-Von Karman 
conditions”). Since we shall have the opportunity to return to this question in connection 
with the study of other periodic structures (cf complement F x.; see also § 1-c of comple¬ 
ment Qnv)* we shall not dwell any further on periodic boundary conditions, but shall 
continue our discussion, confining ourselves to the simple case of an infinite chain. 

The function £(k, t) introduced in (19) is, by definition, the sum of a Fourier 
series whose coefficients are the displacements x q (t). It is a periodic function, 
of period 2tt//, which is therefore perfectly well-defined when its values in the 
interval — nfl < k < tt// are specified [this is the first Brillouin zone, defined in (15)]. 
£(/:, t) depends on the positions of all the oscillators at time t. Conversely, these 
positions are unambiguously defined when the values of £ in the interval (15) are 
given for time t. This is true because it is possible to invert relation (19) since, using: 

/•+*// ry 

dfc e ,(4_, )W = -j- S qq . 

J-n/l 1 

we obtain: 

j /•+*// 

*,(0 = 2 ^ dk 0 < 

J - nfl 

Note also that, since the displacements x q (t) are real, the function £(&, t) satisfies: 
{(-*,f)-f*(M) (24) 

Similarly, one can, using the momenta p q (t) — mx q (t), define the function: 
*(M) = XP,We-' M (25) 

and the p q (t) can be expressed as: 

t /•+*/! 

P q (t) = ^ Mk, t) e ikq ‘ dk (26) 

J ~njl 


( 22 ) 

(23) 
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The fact that the p q (t) are real implies that: 

7t(- k, t) = n*(k, t) (27) 

Differentiating both sides of (19) term by term, and using (25) and then (21), we 
finally obtain: 

m £(k, t) = 7 t(k, t) (28-a) 

< 

JU(M)= — mQ 2 (k) £(k, t) (28-b) 

At time t, the dynamical state of the system is characterized by the 
specification of the x fl (r) and p q (t) for all integers (positive, negative or zero) as well 
as by that of the “normal variables” £(k, t) and n(k, t) (where k can take on any 
value in the first Brillouin zone). The equations of motion (28) of the normal 
variables corresponding to each value of k describe the evolution of the position 
and momentum of a harmonic oscillator of mass m and angular frequency Q{k)\ 
however, £ and n are complex. Thus we have reduced the study of an infinite but 
discrete chain of coupled harmonic oscillators to that of a continuous system offictitious 
independent oscillators (labeled by the index k). 

comment : 

Rigorously, these fictitious oscillators are not completely independent 
since, according to conditions (24) and (27), the initial values £(k, 0) 
and n(k, 0) must satisfy: 

Z(k, 0 ) = **(-*, 0 ) 

n(k, 0) = tt*(- k, 0) (29) 

(5. Normal variables a(k, t) associated with the progressive waves 

It is convenient (see also § 1-a of complement G v ) to condense the two 
normal variables £{k, t) and n(k, t) into one, a(k, t), defined by: 

a(k, t) = —ip [ |(fc, t) + iit(k, t) ] (30) 

n/2 

where l(k, t) and H(k, t) are dimensionless quantities proportional to £(k, t) 
and n(k, t) : 

Z(k, t) = p(k) Z(k, t) 

0 = 0 (31) 

To simplify the quantum mechanical calculations presented later, we shall set: 

m = /^P (32) 
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It is easy to show, using (30), that the two equations (28) are equivalent to the 
single equation: 

1 ~dt a (k’ ^ = a ^’ ^ (33) 

which is of first order in t [pt(k, t) is completely defined by the specification 
ofa(A:, 0), while £(k, t )depends on £(k, 0)and7i(A:, 0)]. The general solution of (33) is: 


a (k, t) = a(k , 0) e in(ku 


(34) 


Using (19) and (25), we easily obtain the expression for a(k, t) in terms of 
the x q (t) and p q (t): 


1 


a(k, t) =—- p(k) X e iqkl 
V2 « 


x ( t ) + 

q[ ’ + mfl(fc) 


(35) 


Let us show, conversely, that the x q (t) and p q (t) can be simply expressed in terms 
of the a(k, t). According to (24) and (27): 


1 


a*( - k,t) = —p [ <**( - k,t) - i ir*( - fc, t) ] 
V2 


_ 1 
= Si 

From this we deduce: 


[ |(fc, t) - i k(k, t) ] 


m, t) = —j= [ a(fc, t) + a *(- ft, t)] 

V 2 

n{k, t) = - l -j= [ a(k, t) - a*( - fc, t) ] 

v 2 


which allows us to write formula (23) in the form: 

.+? 


= 


/ 


2?r 


r r 

yu-U «*> 


0 .t 


e t9kl dfc 


+ I w* 


g *(- hi f ) „u« 

«*) 


e i4 *' dfc 


} 


(36) 

(37-a) 

(37-b) 

(38) 


Changing k to — k in the second integral, we finally obtain [/?(fc) is an even 
function of fc]: 

^ { 1 ^ e "" d ‘ + •■■■“ dfc } ,39) 
An analogous calculation, starting with (26), yields: 

l n 


Pa(t) = 


2ns/2 


+ — + n 

j <T| J/?(fc) <x(fc, t) e iqkl dfc - | *P(k) a *(k, t) t~ iqkl dfc j> (40) 


The state of the system is therefore described by the a(k, t) as well as by the set 
°f x q ( t ) and p q (t). 
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If we replace, in (39), a.(k , t) by its general expression (34), x q (t) takes on the 

form: 

7T 

The most general solution of the problem of the chain of coupled oscillators is 

therefore a linear superposition of the progressive v 

the coefficients of this linear combination are —-—= 

In-Jl 

constitute the vibrational normal modes of the system*. 
comment : 

For each value of k, the two terms appearing on the right-hand sides 
of (39) and (40) are complex conjugates of each other. This insures the reality 
of the x q (t) and p q (t) without the necessity of imposing an arbitrary condi¬ 
tion on the u(k, t). The a(k, t), consequently, are truly independent variables. 


r aves introduced in § 1 -b (where 
a (k, 0)\ 

— - 1 .These progressive waves 


m 


d. TOTAL ENERGY AND ENERGY OF EACH OF THE MODES 


The total energy of the system under consideration is the sum of the kinetic 
energies of each particle ( q ) and the potential energies (1) and (6): 

Jf(...x_ 1 ,x 0 ,x +l , ...p_ l ,p 0 ,p +l ...) = 


Z 


q = — oc 


1 _ 

2m J 


— P 

->... rq 



(42) 


We shall see in this section that this energy can be expressed very simply in 
terms of the energies which can be associated with each of the modes. 

Let us therefore calculate the various sums involved in (42). Since the 
displacements x q are the coefficients of the Fourier series which defines the 
function £(k, t), Parseval’s relation [appendix I, relation (18)] immediately yields: 


n 


+ oo 

1 

2n 

* 

~T 


z w 2 

= — 00 

(43) 

+ 00 

l 

2n 

% +ir/i 


Z ( p q f 

- - 00 

\n(k, £)| 2 d k 

-nil 

(44) 


* We could also have introduced the modes corresponding to stationary waves in the system (sum 
of two progressive waves of the same frequency and opposite velocities). We would then have obtained 
equivalent results but the motion of the system would have been expanded on a different “basis”. 
An expansion of this type is used in complement K v . 
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The sum which, in (42), corresponds to the coupling, now remains to be calculated. 
To do so, notice as in (18) that, if the displacements x q are the coefficients of the 
Fourier series of £(fc, t ), the x 4 + 1 are those of e ,kl £(fc, t). The quantities (x, — x^ + J 
are therefore the coefficients of the Fourier series of (1 — e ,kl )£(k, t), and 
Parseval’s relation yields: 


it 

I 1(1 - <r dt 


+ 00 


q — — oo 


2tt 

/ 

lit 


i 


^4 sin 2 ^ |£(fc, 0| 2 dfc 


(45) 


Substituting (43), (44) and (45) into (42), we finally obtain: 


Jf = 


n 

l {* f m r 

2n j _n 2 |J 


<y 2 + 4ft>J sin 2 (— 


(jj 0| 2 + ^ HK 01 2 j>dfc (46) 


We shall write this result in the form : 



/ 


with: 

h(k) = ^ mfl 2 (it) |£(fc, 0| 2 + ^ Mfc, 0| 2 


(47) 


(48) 


where Q(k) is given by (10). Jf’ is thus the sum (in fact, the integral) of the energies 
associated with the fictitious uncoupled harmonic oscillators for which £(k, t) gives 
the position and n(k, t), the momentum. 

We can also express h(k) in terms of the variables a(k, t) associated with each 
normal mode. Using (37), we transform expression (48) into : 

h(k) = j fi£2(k ) [ a (fc, t)a*(k, 0 + a*( — k, t)a(— k, 0 ] (49) 

that is, taking (34) into account: 

h(k) = i JiQ(k) [ a(fc, 0)a*(fc, 0) + a*( - k, 0)a( - k, 0) ] (50) 

h(k) is therefore time-independent, which is not surprising since h{k) is the energy 
of a harmonic oscillator. Furthermore, we again find in (47) that the fictitious 
oscillators are independent, since the total energy Jtf is simply the sum of the 
energies associated with each of them. 

Substituting expression (49) into (47), we obtain : 

JT 

= y- j d k ^-TiQ(k) [ a (fc, t)a*(fc, t) + a(— k , f)a*(— fc, t) ] (51) 

I it Z 
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We can then change k to — k in the integral of the second term and consider Jf 
to be the sum of the energies h'(k) associated with the normal modes characterized 
by the a(k, t) : 


.j.r f 

271 j-i 


d k h'(k) 


with : 


h\k) = TiQ{k) a*(k, t) oc{k, t) 
= TiQ(k) a*(k, 0) <x(k, 0) 


2. Quantum mechanical treatment 

The quantum mechanical treatment of the problem of the infinite chain of 
coupled oscillators is based, in accordance with the general quantization rules, 
on the replacement of the classical quantities x ? (t) and p q (t) by the observables X q 
and P q which satisfy the canonical commutation relations: 

[X, />„] = /» (54) 


a. STATIONARY STATES IN THE ABSENCE OF COUPLING 

In the absence of coupling (co 1 = 0), the Hamiltonian H of the system can be 
written: 




- I», (55) 

4 

where H q is the Hamiltonian of a one-dimensional harmonic oscillator acting in the 
state space of particle {q). 

We introduce the operator a defined by: 


jifv 


x + 

i ft q 


\/ mho) 


H„ can then be written: 


H q-\ ( a 9 a q + a l a q) ^ = ^ 4^4 + ( 57 ) 

a\ and a q are the creation and annihilation operators of an energy quantum for 
oscillator ( q ). We know (chap. V, §C-l-a) that the eigenstates of H q are given by: 


< > = -=«)^ | <Pl > 

V («,)! 
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where | q>% > is the ground state of oscillator (q) and n q is a positive integer or zero. 
If we choose as the energy origin the energy of the ground state [which amounts 
to omitting, in (57), the term 1/2], we obtain for the energy E* q of the state | (p q nq >: 

El = n q ho (59) 

In the absence of coupling, the stationary states of the global system are tensor 
products of the form: 

- ® I <PL 1 > <E> | (p° 0 > ® | <Pl > ® - (60) 

and their energies are*: 

E = I El = [... ...] hco (61) 

4 

The ground state, whose energy was chosen for the origin, is not degenerate 
since E = 0 is obtained, in (61), only for: 

n q = 0 for all q (62) 

The corresponding state (60) is therefore unique. On the other hand, all the other 
levels are infinitely degenerate. For example, to the first level, of energy Tuo, 
correspond all states (60) for which all the numbers n q are zero except for one, 
which is itself equal to one. All but one of the oscillators are then in their ground 
states. It is because the excitation can be localized in any one of the oscillators 
that the level E = fico is infinitely degenerate. 


b. EFFECTS OF THE COUPLING 

When the coupling is not zero, the Hamiltonian operator becomes: 

H = H(o) l = 0) + V (63) 

with: 

V I(* 4 - X q+l ) 2 (64) 

Z 4 

The states (60) are no longer, in this case, the stationary states of the system 
as they are eigenstates of H((o l =0), but not of V. To see this, we write V in terms 
of the operators a q and a*: 



Now, it is clear that the action of V on a state of type (60) changes the state : the 
numbers n q are no longer “good quantum numbers” since, for example, Fean 
transfer an excitation from site (<?) to site (q + 1) (term in a q+ l a q ). 

* If we had not changed the energy origin of each oscillator by omitting the term 1/2 in (57), we 
would have found an infinite energy for the system, whatever the quantum number n,. This difficulty 
does not arise if, instead of an infinite chain, one considers a chain formed by a very large but finite 
number of oscillators. However, problems related to “edge effects” then appear. 
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To find the stationary states of the system in the presence of the coupling, it is 
useful, as in classical mechanics, to introduce “normal variables”, that is, operators 
associated with the normal modes of the system. 


c. NORMAL OPERATORS. COMMUTATION RELATIONS 


To the normal variables £(k, t) and n(k, t) correspond the operators 5(k) 


and n(k) defined by: 


q 

(66-a) 

n(k) = £ P q e~ iqkl 
q 

(66-b) 

The domain of variation of the continuous parameter k is again limited to the 
first Brillouin zone (15). Note that since the normal variables £(k, t) and n(k, t) are 
complex, the associated operators E{k) and /7(A:)are not Hermitian, unlike X q and P q . 
The relations which correspond to (24) and (27) are here: 

£) = Z\k) 
n{- k) = Tl\k) 

(67-a) 

(67-b) 


The canonical commutation relations (54) enable us to calculate the com¬ 
mutators of E (k) and I7(k). We immediately see that Z{k) and Z(k') commute, 
as do n(k) and n(k'). As for the commutator [ S(k), II\k')], it can be written : 

im, n\k')i = M e ~ iqkl e +iqkl 

q q' 

= ih Yt~ iq(k ~ k ' )l ( 68 ) 

q 

Using formula (31) of appendix II and the fact that k and k' both belong to 
interval (15), we obtain: 


[_3(k),n\k'n = ifi^d(k-k') (69) 

We saw in §l-c*P that it is convenient to condense the two normal 
variables £(k, t) and n(k, t) into one, a(k, t) [formula (30)]. The operator associated 
with a (k, t ) will be: 



where p(k) is defined in (32). Note that the adjoint of a(k) can be written : 


1 


a'(k)=-^=\mz'M- 

V 2 L 


nm 


n\k) 


(71) 
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Using (69) and (67), we find without difficulty that: 
[a{k),a(k')~\ = [a f (fc), a f (/c')] = 0 
[a(fc), a f (k')] = -y <5(fc - k') 


(72-a) 

(72-b) 


To the classical quantity h(k) defined in (48) corresponds the operator: 
H(k) = ~ n(k) n\k) + 1 mS 2 2 (k) S(k) s\k) (73) 

since S(k) and 2'{k) commute, as do TI(k) and U x {k). To obtain the equivalent 
of the classical formula (49), one must take into consideration the fact that a(k) 
and a x (k) do not commute; the order in which these operators appear must therefore 
be retained throughout the calculation. Relations (37), taking (67) into account, can 
be written here: 

m m = -L [ a(k) + a*( - k) ] (74-a) 

V2 

n(k) = - -L [ a(k) - a'( - k) ] (74-b) 

Substituting these expressions into (73), we find: 

H(k) = ^ tiS2{k) [ a{k)a'(k) + a f ( - k)a( - k) ] (75) 

As in (52), we can put the total Hamiltonian H of the system in the form: 

n 

y kHw * ?6) 

with: 

H (k) = j hQ(k) [ a(k)a'(k) + a'(k)a(k) ] (77) 

a(k) and a'(k) thus can be seen to be annihilation and creation operators analogous 
to those of a harmonic oscillator. However, since k is a continuous index, the 
commutation relations (72) involve d{k — k') instead of a Kronecker delta, so H'(k) 
must remain in the symmetrical form (77). It can easily be shown that the various 
operators H'(k) commute: 

[H'(k), H'(k')] = 0 (78) 


d. STATIONARY STATES IN THE PRESENCE OF COUPLING 

According to formulas (76) and (77), the ground state 10 ) of the system of 
coupled oscillators is defined by the condition: 

a (k) 10 > = 0 (79) 

for all values of k. The other stationary states can be obtained from the state 10 ) by 
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the action of the operators a\k)\ each of the energies is the integral of the energies 
associated with each of the modes. A certain number of difficulties arise because 
of the continuous infinity of normal modes; in particular, the energy of the ground 
state that can be deduced from (76) and (77) is infinite. We shall not discuss these 
difficulties here; in any case, they do not arise for a real chain, that is, a finite 
one (cf. note of § 2-a). 

Formula (10) gives the value of the energy quantum h£2(k) associated with 
each of the modes. It therefore indicates what energy quanta the system can absorb 
or emit: they must correspond to frequencies situated within the allowed band (17). 


3. Application to the study of vibrations 

in a crystal: phonons 

a. OUTLINE OF THE PROBLEM 

Consider a solid body, composed of a large number of atoms (or ions) whose 
equilibrium positions are regularly arranged in space, forming a crystalline lattice. 
For simplicity, we shall assume that this lattice is one-dimensional and can be 
treated like an infinite chain of atoms. We intend to use here the results of the 
preceding section to study the motion of the nuclei of these atoms about their 
equilibrium positions. 

With this object in view, we shall make use of the same approximation as in 
the study of molecular vibrations (Born-Oppenheimer approximation; cf. comple¬ 
ment A v , comment of § 1-a). We shall assume that the motion of the electrons can be 
calculated by considering the positions of the nuclei to be fixed parameters x q . Thus, 
we shall solve the corresponding Schrodinger equation (actually, this equation is 
too complex to be solved exactly; in practice, one must again settle for approxima¬ 
tions). We shall then denote the energy of the electronic system in its ground state 
by x :_j , x 0 , x l5 ...) where x q is the displacement of nucleus (q) from its 

equilibrium position. It can be shown that it is then possible to calculate the 
motion of the nuclei, to a good approximation, by assuming that they possess 
a total potential energy U N (..., x_ ,, x 0 , x t ,...) equal to the sum of their electrostatic 
interaction energy and E el (..., x_ t , x 0 , x t , ...). 

In fact, we shall further simplify the problem by making some reasonable 
hypotheses about U N (this is indispensable, since we do not know E el ). We shall 
assume that U N describes essentially the interactions of each of the nuclei with 
its nearest neighbors (in an infinite linear chain, each nucleus has two such neighbors), 
that is, that the forces exerted between non-adjacent nuclei can be neglected. In 
addition, we shall grant that, in the range of values that the displacements x q can 
attain, U N is well represented by an expression of the form : 

mo>? £ (x q - x q+1 ) 2 (80) 

where m is the mass of a nucleus and o), characterizes the intensity of its interaction 
with its neighbors. We shall not, therefore, take into account terms of higher order 
in (x q — x fl+1 ), that is, the anharmonicity of the potential. 
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Since expression (80) is identical to (6), we can apply the results of the preceding 
sections to the simple model of a solid body that we have just defined. Note, 
nevertheless, that we must choose a> = 0 because U N is the total potential 
energy of the system of the nuclei, which interact with their neighbors, but are not 
elastically bound to their equilibrium positions*. 


b. NORMAL MODES. SPEED OF SOUND IN THE CRYSTAL 

Each of the vibrational normal modes of the crystal is characterized by 
a wave vector k and an angular frequency Q{k). In solid state physics, the energy 
quantum associated with a mode is called a “ phonon ”. The phonons can be consi¬ 
dered to be particles of energy TiQ{k) and momentum fik. Actually, a phonon is not 
a true particle, since its existence involves a state of collective vibration of the real 
particles which constitute the crystal. The phonons are sometimes said to be “quasi¬ 
particles”: they are entirely analogous to the fictitious particles, of position x G (t) 
and x R (t ),introduced in complement H v . In addition, a phonon can be created 
or destroyed by giving to or taking from the crystal the corresponding vibrational 
energy, while (at least in the non-relativistic domain to which we are restricting 
ourselves), a particle such as an electron cannot be created or destroyed. In this 
connection, note that, since the number of phonons in a given mode is arbitrary, 
phonons are bosons (chap. XIV). 



Dispersion relation for phonons (curve of figure 2 for co = 0); the slope of the curve at the origin 
gives the speed of sound in the crystal. 


* The Einstein model, which we described in complement A v , is based on a different hypothesis : 
each nucleus is assumed to “see'* an average potential, due to its interactions with the other nuclei, but 
practically independent of the exact positions of these other nuclei. To a first approximation, this average 
potential is assumed to be parabolic, and one has a system of independent harmonic oscillators. 
Here, on the other hand, we are studying a somewhat more elaborate model, in which we explicitly 
(although approximately) take into account interactions between the nuclei. 
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The dispersion relation giving the function Q(k) differs, for phonons, from 
the one discussed in §l-b-5, since the angular frequency a> is zero here. In this 
case, choosing a> = 0 in (10), we obtain : 


Q(k) — 2<y, 



(81) 


The curve which represents Q(k) is given in figure 5; it is composed of two half-arcs 
of a sinusoid. Unlike what happens for a> not equal to zero, Q(k) now goes to zero 
for k = 0 and varies linearly when k is very small, since, as long as: 

|*| « j (82) 

we have: 

Q(k) a to, \kl\ = v s \k\ (83) 

where: 

v s = to,/ (84) 

Condition (82) means that the wavelength 2 tt/|A:| associated with the mode 
being considered must be much greater than the separation between nuclei. For 
such wavelengths, the discontinuous structure of the chain is negligible, and the 
medium is not dispersive: the phase velocity Q(k)l\k\ ~ v s is independent of k, 
which implies that a wave packet involving only small values of k (of the same sign) 
propagates without being deformed at the velocity v s . Since acoustical wavelengths 
satisfy (82), v s is the speed of sound in the crystal. 

When |*| is of the order of 1//, the discontinuous structure of the chain 
becomes important, and the angular frequency Q(k) increases less rapidly with |*| 
than formula (83) would indicate (in figure 5, the curve deviates from the straight 
dashed lines which are its tangents at the origin). The medium is then dispersive, 
and a wave packet moves with a group velocity: 


v 


__ d Q(k) 
G ~ dk 



(85) 


Finally, when * approaches the edges of the first Brillouin zone (* —*• ± njl ), 
we see from figure 5 that the group velocity approaches zero. As in an electromagnetic 
waveguide, the propagation velocity goes to zero when the cut-off frequency 
(here co l /2n) is attained. 

Figure 5 can also be seen as giving the spectrum of possible phonon 
energies TiQ{k) in terms of their momentum Tik. Knowledge of such a spectrum, for 
a real crystal, is very important. It gives the precise energies and momenta that the 
crystal can supply or absorb when it interacts with another system. For example, 
the inelastic scattering of light by a crystal (the Brillouin effect) can be interpreted 
as the result of the destruction or creation of a phonon, with a change in the 
energy and momentum of the incident photon (total energy and momentum being 
conserved throughout the process). 


COMMENT : 

The simple one-dimensional model described here has allowed us to 
present some important physical concepts, which remain valid for a real 
crystal: energy quanta associated with the normal modes, dispersion of the 
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medium, allowed and forbidden frequency (and, therefore, energy) bands. 
In reality, the crystalline lattice is three-dimensional, and a normal mode is 
characterized by a true wave vector k; Q then depends, in general, not only 
on the absolute value of k, but also on its direction. Also, the situation may 
arise (as is the case for an ionic crystal) in which the vertices of the lattice 
are not all occupied by identical particles, but rather, for example, by two 
different types of particles in regular alternation*. Then, for each wave 
vector k, several angular frequencies £2(k) appear. Some of them, which go 
to zero when |k| —0, constitute “acoustic branches” like the one encoun¬ 
tered above; the others belong to what are called “optical branches”**, 
in which a phonon of zero momentum has a non-zero energy. It would be out 
of the question to study all these problems here, although they are of 
primordial importance in solid state physics. 


References and suggestions for further reading: 

Chains of coupled classical oscillators : Berkeley 3 (7.1), §§2.4 and 3.5. 
See section 13 of the bibliography, particularly Kittel (13.2), chap. 5. 
Other examples of collective oscillations: Feynman III (1.2), chap. 15. 


* A real crystal also contains impurities and imperfections distributed at random. Here we are 
speaking only about perfect crystals. 

** This name arises from the fact that, in an ionic crystal, the “optical’' phonons are coupled to 
electromagnetic waves like those in the visible domain, whose wavelengths are much larger than the 
atomic separation. 


604 




VIBRATIONAL MODES OF A CONTINUOUS SYSTEM ; PHOTONS 


Complement K v 

VIBRATIONAL MODES OF A CONTINUOUS PHYSICAL SYSTEM. 
APPLICATION TO RADIATION ; PHOTONS 


1. Outline of the problem 

2. Vibrational modes of a continuous mechanical system: example of a vibrating string 

a. Notation . Dynamical variables of the system 

b. Classical equations of motion 

c. Introduction of the normal variables 

d. Classical Hamiltonian 

e. Quantization 

3. Vibrational modes of radiation : photons 

a. Notation. Equations of motion 

b. Introduction of the normal variables 

c. Classical Hamiltonian 

d. Quantization 


1. Outline of the problem 

In complements H v and J v , we introduced the idea of normal variables for 
a system of two or a countable infinity of coupled harmonic oscillators. The aim 
of this complement is to show that the same ideas can also be applied to the 
electromagnetic field, which is a continuous physical system (there is no natural 
lower bound for the wavelength of radiation). 

This study raises a certain number of delicate problems. Therefore, before 
beginning, and in order to make a smooth transition between this complement and 
the preceding ones, H v and J v , we shall start by studying in §2 the vibrational modes 
of a continuous mechanical system: a vibrating string. It is obvious that, on the 
atomic scale, such a system is not continuous: the string is composed of a very 
large number of atoms. However, we shall ignore this atomic structure and treat 
the string as if it were really continuous, since the fundamental aim of the 
calculation is to show how normal variables can be introduced for a continuous 
system. Since, moreover, we are dealing with a mechanical system, we can, 
without difficulty, define the conjugate momenta of the normal variables, 

calculate the Hamiltonian of the system, and show that it indeed appears in 

the form of a sum of Hamiltonians of independent one-dimensional harmonic 
oscillators. We shall also discuss in detail the quantization of such a system. 

The results obtained in §2 will enable us to attack, in §3, the problem 
of the vibrational modes of radiation. We shall show that the study of radiation 
confined to a parallelepiped cavity leads to equations which are very similar 
to those of the vibrating string. The same transformations allow the introduction 
of completely uncoupled normal variables for the radiation (associated with the 
standing waves which can exist inside the cavity). Then we shall generalize the 

results obtained in §2 to derive simply the concept of a photon (as it is not 
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feasible here to show rigorously how one can introduce, for a non-mechanical 
system like the electromagnetic field, conjugate momenta, a Lagrangian and a 
Hamiltonian). 


2. Vibrational modes of a continuous mechanical system: 
example of a vibrating string 

a. NOTATION. DYNAMICAL VARIABLES OF THE SYSTEM 



FIGURE 1 

Vibrating string passing through two fixed points O 
and P and having tension F; u(x. t) denotes the 
deviation with respect to the equilibrium position 
of the point of the string situated at a distance x 
from O . 


The string is fixed at a point O (fig. 1). It passes through a very small hole P 
pierced in a plate, and a weight exerts a tension F on it. For simplicity, we 
assume that the string always remains in the same plane, passing through O and P. 
Its state is defined at time t when we know at this time the displacement u(x, t) 
of the various points (labeled by their abscissas x on OP), as well as the corres¬ 


ponding velocities 


du(x, t) 


dt 


. The constraints imposed at O and P are expressed by the 


boundary conditions: 


«(0, t) = u{L, t) = 0 (1) 

where 0 and L are the abscissas of the points O and P. 

It is important to remember that, in this problem, the dynamical variables 
are the displacements u(x, t) at each point of abscissa x: there is a continuous 
infinity of dynamical variables. Consequently, x is not a dynamical variable, but 
rather a continuous index which labels the dynamical variable with which we are 
concerned (x plays the same role as indices 1 and 2 of complement H v or as 
the index q of complement J v ). 


b. CLASSICAL EQUATIONS OF MOTION 


Let n be the mass per unit length of the string. If we assume the string 
to be perfectly flexible and if we confine ourselves to small displacements, a 
classical calculation enables us to obtain the partial differential equation satisfied 
by u. We find: 




«(x, t) = 0 


( 2 ) 
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where: 

v = F In (3) 

is the propagation velocity of a perturbation along the string. 

Such an equation expresses the fact that the evolution of the variable u 
corresponding to the point x depends on the variables u at infinitely near points 
(via d 2 u/dx 2 ). Thus the variables «(x, t) are all coupled to each other. We can 
then ask the following question: is it possible, as in complements H v and J v , to 
introduce new, uncoupled variables which are linear combinations of the 
variables u(x, t) associated with the various points x? 


INTRODUCTION OF THE NORMAL VARIABLES 


Consider the set of functions of x: 


fk(x) = It sin Ut 


where k is a positive integer : k = 1, 2, 3, ... The / k (x) satisfy the same boundary 
conditions as «(x, /): 

m = MM = o (5) 

In addition, it is easy to verify the relations: 

r L 

/*(*) Jk (x) dx = V (6) 

Jo 

(orthonormalization relation) and: 

(^+^W)-0 (7) 


It can be shown that any function which goes to zero at x = 0 and x = L 
[as is the case for u(x, f)] can be expanded in one and only one way in terms of 
the f k (x). We can therefore write: 

w(x, f) = X fk( x ) (8) 


q k {t) can be obtained easily, using (6): 
Q k {t) = f «(*< 0 fk(x) dx 


The state of the string at the instant / can be defined by the set of values 

«jn(x, t),u(x, t)^j»corresponding to the various points x as well as by the set of 

numbers { q k {t), q k (t) }. The new variables q k (t) just introduced are linear combi¬ 
nations of the old u(x, t), as can be see from (9). The converse is obviously also 
true \cf. formula (8)]. 
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To obtain the equation satisfied by the q k (t), we substitute expansion (8) 
into the equation of motion (2). Using (7), we obtain, after a simple calculation: 

(10) 


that is, since the f k (x) are linearly independent: 



Qk(t) = 0 


with: 


o> t 


knv 

~~L 


(u) 

( 12 ) 


Thus we see that the new variables q k (t), also called normal variables, evolve 
independently: they are uncoupled. Moreover, equation (11) is identical to that of 
a one-dimensional harmonic oscillator of angular frequency co k , so that: 

?k(f) = A k cos (co k t - q> k ) (13) 

Each of the terms q k {t)f k {x) appearing on the right-hand side of (8) consequently 
represents a standing wave of frequency (o k /2n and half-wavelength Ljk. Each 
normal variable q k is therefore associated with a vibrational normal mode of the 
string, the most general motion of the string being a linear superposition of 
these normal modes. 


comment : 

In complement J v , we started with an infinite discrete set of harmonic 
oscillators and introduced a continuous infinity of normal variables. Here 
we find ourselves in the opposite situation : the u(x, t) form a continuous 
set with respect to the index x, while, because of the boundary conditions, 
the normal variables q k (t) are labeled by a discrete index k. 


d. 

a. 

is 


1 


CLASSICAL HAMILTONIAN 

Kinetic energy 

The kinetic energy of the segment of string included between x and x + dx 
p dx p~g~ • It follows that the total kinetic energy 2T of the string is 


equal to: 

sr 


-§flM 


dx 


J can be expressed simply in terms of the q k , using (8): 


(14) 


( 15 ) 
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which can also be written, taking (6) into account: 


2 M d t 


|3. Potential energy 

Consider the segment of string included between the abscissas x and x + dx. 
It makes an angle 6 with the Ox axis such that: 

tan 6 = ,17) 


Its length is therefore equal to: 


= dx 


I + tan 2 6 


Since the displacements are small, 6 is very small, and we can write: 




From this we deduce that the total increase in the length of the string with 
respect to its equilibrium position (which corresponds to « = 0 for all x) is 
equal to: 

AL=l f I^Adx (20) 


Now, AL represents the distance over which the weight stretching the rope has 
risen. The potential energy i r of the string, with respect to the value corresponding 
to the equilibrium position, is therefore equal to: 

^ r _ i r f 1 1M*. t)\ 2 . ,,,, 


r = FAL =i F ) 0 [^) ix (2,) 

'f can also be expressed in terms of the normal variables q k . A simple 
calculation, using (8) and (4), yields: 

(22) 

k L 


y. Conjugate momenta of the q k ; classical Hamiltonian 

The Lagrangian 2£ of the system (cf appendix III) can be written: 

2T = 3r-r=^Y J \_ql-<qU (23) 

L k 

From this, we deduce the expression for the conjugate momentum p k of q k : 
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so that we finally obtain, for the Hamiltonian J^(q k9 p k ) of the system, the 
expression: 

jr = r + ^ = X T ^ + \f l ( 25 ) 

that is: 


^ = ( 26 ) 

k 

with: 

K = ^+-2^1*1 < 27 > 

Since p k and q k are conjugate variables, we recognize h k to be the Hamiltonian 
of a one-dimensional harmonic oscillator of angular frequency a> k . Jf is therefore a 
sum of Hamiltonians of independent one-dimensional harmonic oscillators (inde¬ 
pendent, since the normal variables are uncoupled). 

It is useful to introduce, as in complements B v and C v , the dimensionless 
variables: 


— PkQk 

Pk= J^ Pk 

where: 



(28-a) 

(28-b) 


(29) 


is a (dimensional) constant. can then be written: 

jf = [ql + pU 

k Z 


(30) 


e. QUANTIZATION 

a. Preliminary comment 

The calculations performed in this section are, of course, not intended 
to reveal typically quantum effects in the motion of a macroscopic vibrating 
string. The vibrational frequencies a> k j2n which can be excited on such a string 
are so low (of the order of a kilohertz at most) and the elementary energies Ha> k 
so much smaller than the macroscopic energy of the string, that a classical 
description is quite sufficient. It might be thought that a> k could be as large as 
we like because k has no upper bound in formula (12). In fact, for sufficiently 
small wavelengths 2L/k, the rigidity of the string can no longer be neglected, 
and equation (2) is no longer valid. Furthermore, as we pointed out in the 
introduction, the string is not really a continuous system, and it would make no 
sense to consider wavelengths smaller than the interatomic distance. 

The calculations which will be presented here must be considered as a 
simple first approach to problems posed by the quantum mechanical description 
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of radiation. Now, radiation constitutes a truly continuous system (no natural 
lower bound exists for the wavelength) and satisfies an equation which is analogous 
to (2), whatever frequencies and wavelengths may be involved*. 

P- Eigenstates and eigenvalues of the quantum mechanical Hamiltonian H 

We quantize each oscillator by associating with q k and p k [see formula (28)] 
observables Q k and P k such that: 

= / (31) 

Since the normal variables are uncoupled, we also assume that the operators 
relating to two different oscillators commute. We therefore have : 

= l &kii (32) 

Let: 

H k =jha> k (Q 2 k + P 2 k ) (33) 

be the quantum mechanical Hamiltonian of oscillator k. From the results of 
chapter V, we know its eigenstates and eigenvalues: 

H k I «* > = ("k + I "k > (34) 

where n k is a non-negative integer (to simplify the notation, we shall write | n k ) 
instead of | cp nk )). 

Since the H k commute, we can choose the eigenstates of H to be in the 
form of tensor products of the | n k ): 

I «i > I «2 > - I "k > - = I n k ,... > (35) 

The ground or “vacuum” state corresponds to all the n k equal to zero: 

| 0, 0,... 0... > = | 0 > (36) 

When we choose the energy origin to be the energy of the state [ 0 ), the energy 
of state (35) is equal to: 

£ B1 n2 ...n k ... = I «k (37) 

k 

A state such as (35) can be considered to represent a set of «, energy quanta 
Tito,, ..., n k energy quanta Tito*, ... These vibrational quanta are analogous to the 
phonons studied in complement J v . 

Finally, using P k and Q k , we can introduce, as in §B of chapter V, creation 
and annihilation operators for an energy quantum Tito*: 

a k ~ 7= (Qk 4“ iP k ) (38) 

v 2 

* If we were really interested in a microscopic “ vibrating string” (for example, a linear macro¬ 
molecule), it would be more realistic to consider, as in complement J v , a chain of atoms and to study not 
only their longitudinal displacements but also their transversal ones (transverse phonons). 
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where a k is the adjoint of a k . We then have: 


— &kk' 

and: 

(39) 

J a k | n v n 2 , ..., n k , ... > = | n u n 2 , ..., n k - 1,... > 

1 n 2> ■■■> n k> ... > = n/m* + 1 | n t , n 2 ,..., n k + 1,... > 

(40) 

All the states (35) can be expressed in terms of the vacuum 

state | 0 ): 

I v («l) Bl («2)" 2 (4T k Inx 

vn,! v« 2 ! v n k \ 

(41) 

y. Quantum mechanical state of the system 


The most general quantum mechanical state of the system 
position of the states | n t , n 2 , ... n t , ... >: 

is a linear super- 

1 Ht) > = £ c ni> „ 2 Hk Jt) | n k , n 2 , ..., n k ... > 

m,n 2 ,* 

(42) 

The equation of motion of | i//(t) > is the Schrodinger equation: 


i?,±- t i m > = h\ m > 

(43) 

Using (37) and (43), we easily obtain: 


-i £ n k u> k t 

c n 1>B2 . « k Jt) = C nin2 .„ k .„(0) e * 

(44) 


5. Observables associated with the dynamical variables u(x, t) 

When the system is quantized, u(x, t) becomes an observable V(x) which 
does not depend on t* and which is obtained by replacing in (8) q k (t) by the 
observable Q k : 

U(x) = X fk(x) Q k 

k 

= I — l ~r fk ( x ) [«* + flt] (45) 

k /W2 

Thus we see that a displacement observable U(x) can be defined for each value of x, 
and that it depends linearly on the creation and annihilation operators a k and a k . 

It is interesting to compare the mean value of U(x), < ^(f) | U(x) | ij/(t )>, 
with the classical quantity u(x, t). Since, according to (40), a k and a' k can link 
only states whose energy difference is + Tuo k , we deduce from (45) that the only 
Bohr frequencies which can appear in the evolution of < U(x) >(r) are the 
frequencies o) i /2n, a> 2 /2n, ... o)J2n, ... associated, respectively, with the spatial 


* Recall that, in quantum mechanics, the time dependence is usually contained in the state 
vector and not in the observables ( cf discussion of §D-l-d of chapter III). 
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functions f 3 {x\ f 2 (x),..., f k (x),... Thus we find for < t/(x) >(r) a linear superposition 
of the standing waves which can exist on the string. This analogy can, moreover, 


be pursued further. Let us calculate the derivative 
that [cf. complement G v , equation (17)]: 


d 2 

8f 2 


< t/(x) > ( t); using the fact 



»«>* < a k> 


(46) 


and relations (7) and (12), we easily find that the mean value of U(x) given 
by (45) satisfies the differential equation: 



< U(x) >(() = 0 


(47) 


which is identical to (2). 

Finally, note that since Q k does not commute with H k , U(x) does not 
commute with H. The displacement and the total energy are therefore, in quantum 
mechanics, incompatible physical quantities. 


3. Vibrational modes of radiation: photons 

a. NOTATION. EQUATIONS OF MOTION 

The classical state of the electromagnetic field at a given t is defined when 
we know, for this time, the value of the components of the electric field E and the 
magnetic field B at each point r of space. As in §2 above, we therefore have 
a continuous infinity of dynamical variables: the six components E x , E y , E z and 
B x , B y , B z at each point r. 

In order to stress the importance of the idea of normal variables (or normal 
modes) of a field, we shall introduce a simplification which consists of forgetting 
the vector nature of the fields E and B: we shall base our arguments on a scalar 
field r, t) which obeys (like each of the components of E and B) the equation: 

(^- j h r - ,)=o i48 > 

where c is the speed of light. 

We shall assume the field to be confined to a parallelepiped cavity whose 
inside walls are perfectly conducting and whose edges, parallel to Ox, Oy, Oz, have, 
respectively, the lengths L k , L 2 , L 3 . As boundary conditions, we require 
£f(r, /) to be zero on the walls of the cavity (in the real problem, it is, for example, 
the tangential components of the electric field E that must go to zero on these walls). 
We can therefore write: 

Sf(x = 0, y, z, t) = Sf{x = Lj, y, z, t) = £f{x, y = 0, z, t) = ... 

= &{x,y,z = L 3 ,t) = 0 (49) 
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b. INTRODUCTION OF THE NORMAL VARIABLES 


Consider the set of functions of x, y , z: 


fklm(x, y, 2 ) = 





( 50 ) 


where k, l, m are positive integers ( k, l, m = 1, 2, 3, ...). The J klm (x , y, z) go 
to zero on the walls of the cavity and therefore satisfy the same boundary 
conditions as £f(x, y, z, t ): 

fkrnXx = 0, y, z) = f klm {x = L,,y, z) = ... = f kl Jx, y, z = L 3 ) = 0 (51) 


In addition, the following relations are simple to verify: 


p L I p Lz p Ly 

dx dy d z f klm {x, y, z) f krm {x, y, z) = 

Jo Jo Jo 


d kk ' tyr dmm' 


and: 



fktm(x, y, z) = 0 


(52) 


(53) 


Any function which goes to zero on the walls of the cavity, y(r, t) in 
particular, can be expanded in one and only one way in terms of the f klm (x, y, z). 
We therefore have: 

^(x, y, z, t) = X Qkimit) fkimix, y, Z) (54) 

k,Lm 

Formula (54) can easily be inverted, with the help of (52): 


0Mm(O — 


» L i p L 2 p L 3 

dx dy d zf klm (x, y, z) 5^(x, y, z, f) 

Jo Jo Jo 


(55) 


Thus we see that the field at time t is described by the set of variables 
q klm {t) as well as by the set of variables y(x, y, z, t). Formulas (54) and (55) enable 
us to go from one set to the other one. 

Substituting (54) into (48) and using (53), we obtain, after a simple 
calculation: 



(56) 


(57) 


The normal variables q klm (t) are therefore uncoupled. According to (56), q klm (t) varies 
like A cos ( co klm t - q>). Each of the terms q klm (t)f klm (x, y, z) of the sum (54) 
therefore represents a standing wave (a vibrational normal mode of the field 
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in the cavity) characterized by its frequency a> kl J2n and its spatial dependence 
in the three directions Ox, Oy, Oz (half-wavelengths L l /k, L 2 /l and L 3 /m respec¬ 
tively). 

Thus we have been able to generalize the results of §2-c without difficulty. 
Note, nevertheless, that when the vectorial nature of the electromagnetic field is 
taken into account, the structure of the modes is more complex. However, the 
general idea is the same, and one reaches similar conclusions. 


c. CLASSICAL HAMILTONIAN 

Basing our discussion on the very close analogy between the results of §2-c 
and those of §3-b, we shall assume without proof that one can associate with 
the field y(r, t) a Lagrangian 3t, from which can be deduced the equation of 
motion (48), the conjugate momenta p kim (t) of the normal variables, and finally, 
the expression for the Hamiltonian of the system. The only point which is 
important here is that this Hamiltonian is analogous to (30): 

= X [((?*lm) 2 "b (P*lm) 2 ] (58) 

k,l,m Z 

where g klm and p klm are dimensionless variables proportional to the q klm and p klm : 

Qklm Pklm Qklm Pklm -foO Pklm (59) 

[} klm is a (dimensional) constant which is analogous to the one introduced in (29). 


comments: 


(/) The equation of motion of each normal variable q klm [established in (56)] is analogous to 
that of a one-dimensional harmonic oscillator of angular frequency co ktm . Thus we see why 
we obtain for 3C a sum of Hamiltonians of independent one-dimensional harmonic 
oscillators. It is possible, moreover, to obtain (56) from (58). The Hamilton-Jacobi 
equations (cf. appendix III) can, in fact, be written, taking (59) into account : 


' d §ldm _ 1 3Jtr 

J d t ft 8p klm 

d Pum = 1 Sjf 

- dt h dq klm 


(60) 


that is, with the form (58) of Jf: 


dr 

dPklm 
k dt 


t^klm Pklm 


~ W klm Qklm 


(61-a) 
(61-b) 


Eliminating p klm between these two equations, we indeed find (56). 

(ii) For the real electromagnetic field, composed of two fields E and B, one can also obtain 
expression (58) for Jf directly without using the Lagrangian. One simply takes the total 
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energy of the field as the sum of the electrical and magnetic energies contained in the 
cavity: 

C Li M L 2 f* L 3 

= -y dx dy d z [E 2 + c 2 B 2 ] (62) 

J 0 o 

and uses for E and B expansions analogous to (54). Thus one finds that the terms 
in Pf lm and q} lm of (58) correspond respectively to the electrical and magnetic energies. 


d. QUANTIZATION 

Now, starting with equation (58), we can carry out the same operations 
as in §2-e. 


a. Eigenstates and eigenvalues of H 

We associate with q klm and p klm two observables Q kim and P klm whose commu¬ 
tator is equal to i. Since observables relating to two different modes commute, 
we have, in general: 

\_Qklnf ^k’l’m'] = ^kk^ll ^mm’ ( 63 ) 

Let H klm be the Hamiltonian associated with the mode ( klm ): 

Hu. - [ (e«.) 2 + < p *-n <«> 


We know its eigenstates and eigenvalues: 


Hklm I n klm y — ( n klm + 2 J^^ktm \ n klm ) 


where n klm is a non-negative integer. 

Since the H klm commute, we can choose the eigenstates of H = £ H klm 

klm 

to be in the form of tensor products of the | n klm > : 

I rt lll' n 2\ I’ rt 121’ n \ 12. n klm ’••• y (66) 

The ground state, called the “vacuum”, corresponds to all the n klm equal to 
zero : 

| 0, 0, 0, 0,0,... > = | 0 > (67) 

With respect to the vacuum state energy, the energy of state (66) is equal to: 
£ Bll . . = Z n klm hio klm (68) 

klm 

A state such as (66) can be considered to represent a set of « ni energy quanta 
... n klm energy quanta fia> klm ... These quanta are none other than the 
photons. Thus we see that a certain type of photon is associated with each normal 
mode of the cavity. 
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We can, as in (38), introduce annihilation and creation operators for a 
photon of type ( klm): 


a klm — (Qklm + IP klm ) 
I 

a klm = 7 ~(Qklm ~ klm ) 

v 2 


and establish formulas identical to (39), (40) and (41): 


w] = 


IV 




(70) 

111’ *•*’ 

n klm ’ 

II 

XX 

v n klm | Hi 11 

* ***’ ^ * 


(71) 

111’ 

* ’ 

...) = 

V n klm + 1 1 

«i i! , 

+ 1 ,... > 


n klm > ** 

• > = 

w,iy 

, " 1 (a'klmT" 

/ 

m 

“ - 1 o > 


(72) 



V^, 

1 ' V ft k i m 

| 




p. Quantum state of the field 

The most general state of the field is a linear superposition of states (66): 
l*A(0>= I . nktm ,...{t)\n ui ,...,n klm ,...y (73) 


l*A(0>= I c niii . Bklim Jf)|«iii.- n u «.~> 

"1 .. f*ktm 

The Schrodinger equation: 


/«A|*(f)> = H|*(f)> 

enables us to obtain the coefficients c 


"ill 


(t) in the form: 


"111 -."klm 


-1 £ "kl»i «>kl m t 

.„.....(0)e 


y. Field operator 

Upon quantization, 5^(r, t) becomes an observable S(r) which no longer 
depends on t and is obtained by replacing in (54) q klm (t) by Q klm : 

S(r) = f u J r) Qkim (76) 

klm Pklm 

One can also, with the help of (69), express S(r) in terms of the creation and 
annihilation operators: 

S( r ) — 7= X 7? Alm( r ) [ a klm (77) 

y/ 2 klm Pklm 
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The same arguments as in §2-e-8 enable us to show, using formulas (71) 
and (75), that the only Bohr frequencies which can appear in the time evolution 
of the mean value of the field: 

<S(r)X0 = <^(r)\S(r)\il/(t)} 

are the frequencies o> ul /27c, co 211 /2tc, ..., co klm /2n , ... associated, respectively, with 
the spatial functions / in (r), / 211 (r), ... f klm { r)... Thus we find for <S(r)>(0 a 
linear superposition of the classical standing waves which can exist in the cavity. 
A calculation identical to the one in §2-e-S would enable us to show that 
< S(r) ) satisfies equation (48). 

Finally, we find that S(r) and H do not commute. It is therefore impossible, 
in quantum theory, to know simultaneously and with certainty both the number 
of photons and the value of the electromagnetic field at a point in space. 

comment : 

For the electromagnetic field, coherent states can be constructed which are ana¬ 
logous to the ones introduced in complement G v and which represent the best possible 
compromise between the incompatible quantities, field and energy. 

5. Vacuum fluctuations 

We saw in §D-I of chapter V that, in the ground state of a harmonic oscillator, < X} 
is zero while < X 2 ) is not, and we discussed the physical meaning of this typically quantum 
mechanical effect. 

In the problem we are studying here, S(r) presents many analogies with the X operator 
of chapter V; we see from (77) that S(r) is a linear combination of creation and annihilation 
operators. Consider the mean value of S(r) in the ground state | 0 ) of the field, that is, the 
“vacuum’' state of photons. Since the diagonal elements of a and a f are zero according to (71), 
we see that: 

< 0 I S(r) | 0 > = 0 (78) 

On the other hand, the corresponding matrix element of [S'(r)] 2 is not zero. According 
to (71): 

a klm | 0 ) =0 

< < 0 I al rm , =0 

0 | a k .,, m ,al tm | 0 y = 6 U . (5 mm . (79) 

Therefore, a simple calculation enables us to establish, using (77), that: 

< 0 | [S(r)] 2 1 0 > = ~ X ~~ [Aim(r)] 2 (80) 

^ klm Pklm 

From this we see that in a vacuum, that is, in the absence of photons, the electromagnetic 
field S’(r) at a point of space has a zero mean value but a non-zero root-mean-square deviation . 
This means, for example, that if we perform one measurement of S(r), we can find a non-zero 
result (varying, of course, from one measurement to another), even if there is no photon 
present in space. This effect has no equivalent in classical theory, in which, when the energy is 
zero, the field is rigorously zero. The preceding result is often expressed by saying that the 
“vacuum state” of photons is subject to fluctuations of the field, characterized by (78) and (80) 
and called vacuum fluctuations . 
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The existence of these fluctuations has interesting physical consequences for the inter¬ 
action of an atomic system with the electromagnetic field. Consider, for example, an atom 
in a state | E ) of energy E , interacting with a classically represented electromagnetic wave. 
We shall see in complement A XI „, using time-dependent perturbation theory (cf. chap. XIII), 
that under the effect of such an excitation, the atom can move to a higher energy state (absorption) 
or to a lower energy state (induced emission ). But in this semi-classical treatment, if no field 
is present in space, the atom must remain indefinitely in the state | E >. In fact, we have just 
established that, even in the absence of incident photons, the atom “sees” the “vacuum fluc¬ 
tuations” related to the quantum mechanical nature of the electromagnetic field. Under the 
effect of these fluctuations, it can emit a photon and fall back into a lower energy state (the 
energy of the global system being conserved during this process). This is the phenomenon of 
spontaneous emission , which can thus be considered to be, as it were, an “emission induced by 
the vacuum fluctuations”. (No spontaneous absorption is possible, since this would cause the 
atom to move to a higher energy state, and no electromagnetic energy can be extracted from the 
field, which is in its ground state.) 

It can also be shown that another effect of “vacuum fluctuations” is to impart to the 
atomic electrons an erratic motion which slightly modifies the energies of the levels. The 
observation of this effect in the hydrogen atom spectrum (the “Lamb shift”) constituted the 
point of departure for the development of modem quantum electrodynamics. 

comment : 

In the preceding discussions, we have always chosen the energy of the vacuum 
state as the energy origin. In fact, harmonic oscillator theory gives us the absolute value 
of the energy of the vacuum state : 

(81) 

There is obviously a close relationship between E 0 and the electrical and magnetic 
energy associated with “vacuum fluctuations”. One of the difficulties of quantum 
electrodynamics, of which we have just given a brief sketch, is that the sum (81) is in 
fact infinite, as is, moreover, (80)! Nevertheless, it is possible to surmount this difficulty : 
using the procedure called “renormalization”, one manages to bypass infinite quantities 
and calculate the physical effects which are actually observable, such as the “Lamb 
shift”, with remarkable accuracy. It is obviously out of the question to consider these 
vast problems here. 


References and suggestions for further reading: 

Modes of a continuous string in classical mechanics: Berkeley 3 (1.1), §§2.1, 
2.2 and 2.3. 

Quantization of the electromagnetic field: Mandl (2.9); Schiff (1.18), chap. 14; 
Messiah (1.17), chap. XXI; Bjorken and Drell (2.10), chap. 11; Power (2.11); 
Heitler (2.13). 

The Lamb shift: Lamb and Retherford (3.11); Frisch (3.13); Kuhn (11.1), chap. Ill, 
§ A 5 e; Series (11.7), chaps. VIII, IX and X. 
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Complement L v 

ONE-DIMENSIONAL HARMONIC OSCILLATOR 
IN THERMODYNAMIC EQUILIBRIUM AT A TEMPERATURE T 


1. Mean value of the energy 

a. Partition function 

b. Calculation of < H > 

2. Discussion 

a. Comparison with the classical oscillator 

b. Comparison with a two-level system 

3. Applications 

a. Blackbody radiation 

b. Bose-Einstein distribution law 

c. Specific heats of solids at constant volume 

4. Probability distribution of the observable X 

a. Definition of the probability density p(x) 

b. Calculation of p(x) 

c. Discussion 

d. Bloch's theorem 


This complement is devoted to the study of the physical properties of a one¬ 
dimensional harmonic oscillator in thermodynamic equilibrium with a reservoir 
at temperature T. We know (cf. complement E,,,) that such an oscillator is not 
in a pure state (it is impossible to describe its state by a ket | ij/ )). The partial 
information which we possess about it and the results of statistical mechanics 
enable us to characterize it by a statistical mixture of stationary states | <p„ > 
with weights proportional to e~ En/kT ( k : Boltzmann constant; E n : energy of the 
state | q> n )). We saw in complement E m (§5-a) that the corresponding density 
operator is then written: 

p = Z~ x e~ H/kT (1) 

where H is the Hamiltonian operator, and: 

Z = Tr e -ff/ * T (2) 

is a normalization factor which insures that: 

Tr p = 1 (3) 

(Z is the partition function). 

We shall calculate the mean value < H ) of the oscillator’s energy, interpret 
the result obtained, and show how it enters into numerous problems in physics 
(blackbody radiation, specific heat of solids, ...). Finally, we shall establish and 
discuss the expression for the probability density of the particle’s position 
(the observable X). 
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1. Mean value of the energy 


a. PARTITION FUNCTION 


The energies E n of the states | <p„ ) are, according to the results of §B of 
chapter V, equal to (« + l/2)ha>. Since the energy levels are not degenerate, we 
have, according to (2): 

00 

z = 1 < <P n I e~ H/kT | <p n > 


- (n + 112)h(ofkT 


_ e -W2/trp _|_ Q -h(o/kT _|_ e ~2 hto/kT _|_ j 


Inside the brackets, we recognize a geometrical progression of e hu>lkT . Therefore: 


-hio/lkT 


| _ g -tuojkT 


b. CALCULATION OF < H > 


According to formula (31) of complement E m and expression (1) for p: 

< H} = Tr (Hp) - Z" 1 Tr (He~ H/kT ) (6) 

Writing the trace explicitly in the { | <p„ ) } basis, we obtain : 

< H > = Z-' £ (n + 1/2) T,(oe- in+ll2)MkT (7) 

n = 0 

To calculate this quantity, we differentiate both sides of (4) with respect to T: 

— =— V („ + 1/2) Tuoe-^ il2 ^ lkT (8) 

dr kT 2 „ = o 

We see that: 

< H >‘ kT 'i§ * 9: 

Using (5), we then find, after a simple calculation : 
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COMMENTS : 

Isotropic three-dimensional oscillator 

Using the results and notation of complement E v , we can write: 

< H> = <//,> + < H y > + < H z > 
where < H x > is given by: 

<H,> = Z"‘Tr(fl Jc e _H " ir ) 

00 00 oo 

£ £ £ + 1/2) e “ I(W3e+1/2)+( ” y+1/2)+( ”* +1/2)1Mfcr 

n x = 0 n y = 0 n* = 0 


I I I e 

n x = 0 riy = 0 n* = 0 


“[(«* + l/2) + (n y + l/2) + (rt*+ lf2)]TuofkT 


The sums over n y and n z can be separated out and are identical in the numerator and 
denominator, so that: 


£ (n x + 1/2 )h(DQ («x + i/ 2 )»o>/kr 


<*,>- 


I e" ( ” 


-(n*+l/2)ftci>/fcr 


Aside from the replacement of n by w*, the final expression is identical to the one calcu¬ 
lated in the preceding section; < H x ) is therefore equal to the value given in (10). 
It is easy to show that the same is true for < H y > and < H z >. Therefore, we have 
established the following result; at thermodynamic equilibrium, the mean energy of an 
isotropic three-dimensional oscillator is equal to three times that of a one-dimensional 
oscillator of the same angular frequency. 


(ii) Classical oscillator 

The energy Jtf {x,p) of a classical one-dimensional oscillator is equal to: 

T 1 

J^(x, p) = + — ma> 2 x 2 (14) 

In expression (14), x and p can take on any values between — oo and + oo. 
According to the results of classical statistical mechanics, the mean energy 
of this classical oscillator is given by: 


> = 


/* + 00 /* + 00 

Jf(x, p) e “ je<x,p,/tT dx dp 

J — oo J ~ oo 

/» + 00 /% + 00 

e -X(x,p)/kT dx d p 

J — 00 J — 00 


Substituting (14) into (15), we find, after a simple calculation: 

<Jf > = kT 
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2. Discussion 

a. COMPARISON WITH THE CLASSICAL OSCILLATOR 

In figure 1, the solid line gives the mean energy < H > of the one-dimensional 
quantum mechanical oscillator as a function of T. The dashed line corresponds 
to the mean energy < jf ) of the classical oscillator. 



FIGURE 1 

As a function of the temperature, variation of the 
mean energy of a quantum mechanical oscillator 
(solid line) compared with that of a classical oscil¬ 
lator (straight dashed line). 


For T = 0, {H} = ho)/ 2. This result corresponds to the fact that at absolute 
zero, one is sure that the oscillator is in the ground state | q> 0 ), with energy hoj/2 
(hco/2 is, for this reason, sometimes called the “zero point energy”). As for the 
classical oscillator, it is motionless (p = 0) at its stable equilibrium position (x = 0), 
and its energy is zero: < > = 0. 

As long as T remains small — more precisely, as long as kT < Tuo — only 
the population of the ground state is appreciable, and < H) remains practically 
equal to fito/2 : in this region, the solid-line curve of figure 1 has a horizontal tangent. 
We can see this directly from expression (10), which can be written, for small T\ 

< H > ^ ^ +ha> e~ R “"‘ r (17) 

On the other hand, for large T, that is, for kT > hat, the same formula yields: 

<H>=- r +tr(i- II? +..j (is) 


or: 


(H)^kT (19) 

to within an infinitesimal of the order of kTfiai/kT) 2 . The asymptote of the curve 
giving < H ) as a function of T is therefore the straight line < ) =* kT. 

In conclusion, the quantum mechanical and classical oscillators have the 
same mean energy, kT, at high temperatures (kT fia>). Striking differences appear 
at low temperatures (kT < Tuo): it is no longer possible to ignore the quantization 
of the oscillator’s energy once the energy kT which characterizes the reservoir 
falls to the order of the distance fiat separating two adjacent levels of the oscillator. 
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b. COMPARISON WITH A TWO-LEVEL SYSTEM 


It is interesting to compare the preceding results with those obtained for 
a two-level system. Let 1 \f/ x ) and | <// 2 > be the corresponding states, with energies E x 
and E 2 (with E x < E 2 ). For such a system, the general equation (6) yields: 




E x e - £l/kT + E 2 e~ E2lkT 

EiIkT +e ~E 2 /kT 


( 20 ) 



FIGURE 2 

Mean energy of a quantum mechanical system 
with two energy levels £, and E 2 , in thermo¬ 
dynamic equilibrium at a temperature T. 


The mean energy of a two-level system, given by (20), is shown in figure 2. 
For small T (kT < E 2 — E x ), the terms in e~ £,/,tr are preponderant in both the 
numerator and the denominator of (20) (since E x < E 2 ) and we obtain: 

< # > TZo ( 21 ) 

It can be verified that the curve starts with a horizontal tangent. For 
large T (kT E 2 — E x ), the asymptote of the curve is the straight line parallel 
to the T-axis of ordinate (E x + E 2 )/2. The preceding results are easy to understand : 
for T = 0, the system is in’ its ground state | ), of energy E x , at high temperatures, 

the populations of the two levels are practically equal, and < H ) approaches half 
the sum of the two energies E x and E 2 . 

Although the solid-line curves of figures 1 and 2 have the same shape at low 
temperatures, we see that this is not at all true at high temperatures. For the 
harmonic oscillator, < H ) is not bounded and increases linearly with T, while, 
for a two-level system, < H ) cannot exceed a certain value. This difference is due to 
the fact that the energy spectrum of the harmonic oscillator extends upward 
indefinitely: when T increases, levels of higher and higher n are occupied, and this 
causes < H ) to increase. On the other hand, for a two-level system, once the popu¬ 
lations of the two levels are equalized, an additional increase in the temperature 
does not change the mean energy. 


3. Applications 

a. BLACKBODY RADIATION 

We have already pointed out, in the introduction to chapter V (and in complement K v , 
where we justified this result more precisely), that the electromagnetic field in a cavity is 
equivalent to a set of independent one-dimensional harmonic oscillators. Each of these 
oscillators is associated with one of the standing waves which can exist inside the cavity 
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(normal modes) and has the same angular frequency as this wave. Let us show that this result, 
combined with those obtained above for < > and < H >, leads very simply to the Rayleigh- 

Jeans law and the Planck law for blackbody radiation. 

Let if be the volume of the cavity, whose walls are assumed to be perfectly reflecting. 
The first modes of the cavity (those of lowest frequency) depend strongly on the form of the 
cavity. On the other hand, for the high-frequency modes (those whose wavelength a = c/v is 
much smaller than the dimensions of the cavity), a classical electromagnetic calculation shows 
that, if /V(v) dv denotes the number of modes whose frequency is between v and v 4- dv, 
/V(v) is practically independent of the form of the cavity and equal to: 

. 8ttv 2 

m - —- r ( 22 ) 

c 

Let «(v) dv be the electromagnetic energy per unit volume of the cavity contained in the 
frequency band (v, v + dv) when the cavity is in thermodynamic equilibrium at a temperature T. 
To obtain the energy tT w(v) dv, one must multiply the number of modes whose frequency is 
between v and v + dv by the mean energy of the corresponding harmonic oscillators. We 
calculated this energy above; it is equal to or ftco/l*, depending on whether 

the problem is treated classically or quantum mechanically. We then obtain, using (10), (16) 
and (22): 

8ttv 2 

«d(v) = —(23) 

c 3 


in a classical treatment, and: 


“o( v ) = 


8 nhv* 1 
c 3 e' lv/kT - 1 


(24) 


in a quantum mechanical treatment. 

We recognize (23) to be Rayleigh-Jeans’ law and (24) to be Planck’s law, which reduces to the 
preceding one in the limit of low frequencies or high temperatures (hv/kT 1). The differences 
between these two laws reflect those which exist between the two curves of figure 1. At high 
frequencies, difficulties arise in Rayleigh-Jeans’ law: the quantity u c ,(v) given in (23) approaches 
infinity when v — > oo, which is physically absurd. In order to remedy this defect, Planck was 
led to postulate that the energy of each oscillator varied discontinuously, by jumps proportional 
to v (energy quantization); thus he obtained formula (24), which accounts perfectly for the 
experimental results. 


b. 


BOSE-EINSTEIN DISTRIBUTION LAW 


Instead of calculating the mean value < H ) of the energy, as we did in §1, let us calculate 
that of the operator A. Since, according to formula (B-15) of chapter V: 


H = ( N +-I Ihd 


(25) 


* We use < H ) - ha>j 2 and not < H > for the following reason: t/(v) represents an electromagnetic 
energy which can be extracted from the cavity. At absolute zero, all the oscillators are in their ground 
states and no energy can be radiated outward because the system is in its lowest energy state; u(v) must 
therefore be zero at absolute zero, as experimental observations indeed show it to be. This requires us to 
define the mean energy of the field in the cavity with respect to the value corresponding to T = 0. 
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we deduce from result (10) that: 


<N} 


1 

e' ,v/ * T - 1 


( 26 ) 


The fact that the levels of a one-dimensional harmonic oscillator are equidistant enables 
us to associate with the oscillator in the state | (p n > a set of n identical particles (quanta) of the 
same energy hv. In this interpretation, the operators a f and a, which take | (p n > into !<?„ + !> 
or | <?„_!>, create or destroy a particle. N is thus the operator associated with the number 
of particles (| <p„ > is the eigenstate of N with the eigenvalue n). 

In the special case of the electromagnetic field, the quanta associated with each harmonic 
oscillator are none other than photons . To each mode of the cavity considered in the preceding 
paragraph correspond photons of a certain type, characterized by the frequency, polarization, 
and spatial distribution of the mode. Expression (26) gives the mean number of photons asso¬ 
ciated with a mode of frequency v at thermodynamic equilibrium. We recognize (26) to be the 
Bose-Einstein distribution law, which can be derived in a more general way; here, we have 
established it very simply by studying the harmonic oscillator and interpreting the states | <p n >. 


COMMENT : 


To be rigorous, we should write the Bose-Einstein distribution law for bosons of 
energy s : 


< AO =- 

c (e-tiVkT _ j 


(27) 


where p is the chemical potential. In the case of photons, p — 0. This is due to the fact 
that the total number of photons in the global system radiation-reservoir is not fixed, 
because of the possibility of absorption or emission of photons by the walls. 


c. SPECIFIC HEATS OF SOLIDS AT CONSTANT VOLUME 


We shall confine ourselves here to the Einstein model (cf. complement A v ), in which a solid 
is considered to be composed of Jf atoms vibrating independently about their equilibrium 
positions with the same angular frequency m E . The internal energy U of the solid at the tempera¬ 
ture T is therefore equal to the sum of the mean energies of the Jf isotropic three-dimensional 
oscillators in thermodynamic equilibrium at this temperature. Using comment (/) of §1, we 
see that: 


U= 


(28) 


where < H > is the mean energy of a one-dimensional harmonic oscillator of angular fre¬ 
quency o ) E . We know that the constant volume specific heat c v i s the derivative of the internal 
energy U with respect to the temperature : 


d U d 

V = —- > 

dT dT 


(29) 


which, taking (10) into account, yields: 


c v = l*Jfk 


*2sY e^EikT 

kT J 

[ e *°>ElkT _ J-J 2 


(30) 
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The variation of c v with T is shown in figure 3. According to (29), c v is proportional to the 
derivative of the solid-line curve of figure 1. It is therefore very simple to describe the behavior 
of the specific heat c v as a function of the temperature. In figure 1, we see that < H > has 



figure 3 

(Constant volume) specific heat c v of a 
solid in Einstein's model. The high-tempe¬ 
rature limit corresponds to the classical 
Dulong-Petit law. 


a horizontal tangent at the origin and increases very slowly; c v is therefore zero for T — 0 
and also increases very slowly. On the other hand, for large T(kT > ha) E \ < H ) approaches kT; 
we deduce that c v approaches a constant, 3 ^k y independent of cj e . The transition region 
corresponds to ha> E lkT ^ l. 

The asymptote of figure 3 corresponds to the Dulong-Petit law: if one takes an atom- 
gram of any solid, jV' is equal to Avogadro's number and the limiting value of c v is equal 
to 3 R (where R is the ideal gas constant), that is, to about 6 cal. degree -1 mole -1 . 

As we pointed out above, the quantum mechanical nature of crystalline vibrations 
manifests itself at low temperatures when kT falls to the order of ho) E or less. Insofar as c v 
is concerned, this means that the specific heat approaches zero when T approaches zero. It is 
as if the degrees of freedom corresponding to crystalline vibrations were “frozen” beneath 
a certain temperature and no longer entered into the specific heat. This can be understood 
physically: at absolute zero, each oscillator is in its ground state | <p 0 >; as long as kT < ha> E , 
it cannot absorb any thermal energy, since its first excited state has an energy far greater 
than kT. 


comments: 

(i) Comparison with the specific heat of a two-level system 

We can apply an analogous argument to a sample composed of a set of two- 
level systems (for example, a paramagnetic sample composed of Jf spin 1/2 particles): 
its specific heat c v is given, to within a coefficient, by the derivative of the curve of 
figure 2. For such a system, the variation of c v with T is shown in figure 4. 



FIGURE 4 

Specific heat c v for a set of two-level systems. 
c v approaches zero at high temperatures 
because the energy spectrum has an upper 
bound. 
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The behavior for T ^ 0 is the same as in the case of figure 3. However, we see 
that c v approaches zero when kT P E 2 - E lt since the mean energy then becomes 
independent of T and is equal to (£, + E 2 )/2 (cf. fig. 2). For a two-level system, 
c v therefore has a maximum (Schottky anomaly) whose physical interpretation is the 
following : like the harmonic oscillator, the two-level system cannot absorb any thermal 
energy at very low temperatures, as long as E 2 — E 2 P kT\ c v is therefore zero at the 
origin. Then, as T increases, E 2 becomes populated, and c v increases. When the tem¬ 
perature is high enough for the two populations to be practically equal, the system 
cannot absorb any more thermal energy, since the populations can no longer change : 
c y therefore approaches zero when T —> oo. 

( ii ) Einstein’s model enables us to understand simply why the specific heat c v approaches 

zero when the temperature T approaches zero (a classically inexplicable result). However, 
it is too schematic to describe the exact dependence of c v at low temperatures. 

In a real crystal, the various oscillators are coupled. This gives rise to a set of 
vibrational normal modes (phonons) whose frequencies go from zero to a certain cutoff 
frequency (cf. complement J v ). (30) must then be summed over the different possible 
frequencies v (taking into account the fact that the number of modes whose frequencies 
are included between v and v + dv depends on v). Thus one finds an expression for the 
specific heat which, at low temperatures, varies like T 3 (this is confirmed experimen¬ 
tally). 


4. Probability distribution of the observable X 

a. DEFINITION OF THE PROBABILITY DENSITY p(x) 

Let us return to the one-dimensional harmonic oscillator in thermodynamic 
equilibrium. We seek the probability p(x) dx of finding, in a measurement of the 
position X of the particle, a result included between x and x -(- dx. It is clear that p(x) 
plays an important role in a large number of physical problems. For example, for 
a solid described by Einstein’s model, the width of p(x) gives an idea of the 
amplitude of atomic vibrations; the study of the variation of this width with 
respect to T makes it possible to understand the phenomenon of melting [which 
occurs when the width of p (x) is no longer negligible compared to the interatomic 
distance]. 

When the oscillator is in the stationary state \<p„}, the corresponding 
probability density p„(x) is: 

P„(x) = k(*)| 2 = < * I <PnX<Pn\x> (31) 

At thermodynamic equilibrium, the oscillator is described by a statistical mixture 
of the states |<p„> with the weights: Z~ l Q~ E »/ kT , The probability density p(x) is 
then: 

p(x) = Z~ 1 £ p„(x) e~ EJkT (32) 

n 

p(x) is the weighted sum of the probability densities p„(x) associated with the 
various states | q >„). Some of the p„(x) are shown in figures 5 and 6 of chapter V. 
We shall see later that the oscillations of the functions p n (x) which are visible in 
these figures disappear in the summation over n : we shall show that p(x) is simply 
a Gaussian function. 
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The probability density p(x) defined in (32) is related very simply to the density 
operator p of the harmonic oscillator in thermodynamic equilibrium. Using (31) 


and (32), we obtain: 


p{x) = Z~ 1 X e~ EJkT < x | (p n > < (p n \ x > 

n 

(33) 

On the right-hand side, we can bring in the operator e~ H kT 
account the closure relation for the states | <p n >, can be written 

which, taking into 

e -MT = q—H/ kT ^ | (p n > < <Pn | = Y e~ En/kT | q>„ > < (p n | 

« n 

(34) 

We then see that : 


p(x) = Z~ x < x | e~ H/kT | x > = < x |p| x > 

(35) 

where the density operator p is given by formula (1). p{x) can 
the diagonal element of p which corresponds to the ket \x} t 

then be seen to be 

b. CALCULATION OF p(x) 


We know that: 


H = hco {a*a + —^ 

(36) 

so that p(x) can be written in the form: 


p(x) = Z l e" A/2 F x (x) 

(37) 

with: 


h(o 

(38) 

ilfc 

II 

and: 


Pa(x) = < x | e~ Xa '‘‘ | x > 

(39) 


In order to know p(x), all we need to do, therefore, is evaluate this diagonal matrix element. 

To do this, let us calculate the variation in F x (x) when a is changed to x 4- dx. Since 
the ket | x + dx > is given by [c/ complement E„, relation (20)]: 

\x + dx > = ^1 - i-jjp | x > (40) 

we obtain, substituting this relation and the adjoint relation into (39) (neglecting infinitesimals 
of second order in dx): 

F x (x + dx) = F x (x) +i'y<x| [P, e-^‘] | x > (41) 

The matrix element appearing on the right-hand side of (41) involves the operator P, 
which is proportional to (a — a f ). Now, it is the X operator, proportional to (a + a f ), which 
acts in a simple way on the kets | x >. We shall therefore transform [P, c~ Xata ~\ so as to make X 
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Therefore, we know p(x) to within a constant factor, since, according to (37): 

p(x) = Z 1 e~ xl2 F x ( 0) e“* 2/ « 2 (54) 

Since we know that the integral of p(x) over the whole x-axis must be equal to 1, we obtain 
finally: 

p(x) = —i-p e _J[2/<2 (55) 

£V n 

The function p(x) is thus a Gaussian, whose width is characterized by the length £ defined 
in (52). 


c. DISCUSSION 


Starting from the probability density (55), it is easy to calculate: 

<X > = 0 

< X 2 > = (AX) 2 = ^ (56) 


Figure 5 shows the variation of (AX) 2 with respect to T. We see from (52) that (AX) 2 is 
equal to fi/2 mio when T = 0. This result is not surprising: at T = 0, the oscillator 
is in its ground state, and p(x) is equal to |<p 0 (*)| 2 5 AX is found to be the root-mean- 
square deviation of X in the ground state [cf. formula (D-5-a) of chapter V]. 
Then, (AX) 2 increases, and, when kT P Tuo: 


(AX) 2 


kT 

■ ■■ 

T oo 7 

mco 


(57) 


In this case, furthermore, p(x) becomes identical to the probability density of a 
classical oscillator in thermodynamic equilibrium at the temperature T: 


Pd{x) = 


e ~V(x)/kT 



e — V(x)JkT dx 


I mco 2 * 2 

... e 2 kT 

l2nkT 

V mco 2 


(58) 


which leads to (Ax cl ) 2 = kT/mco 2 (the straight dashed line in figure 5). Here again, 
classical and quantum mechanical predictions meet for kT P Tuo. 



FIGURE 5 

Variation with respect to the temperature T 
of (AX)\ for a harmonic oscillator in thermody¬ 
namic equilibrium. When T —* oo, AX is 
identical to the classical value, shown by the 
dashed line; at low temperatures, quantum 
mechanical effects (Heisenberg uncertainty 
relation) prevent AX from approaching zero. 
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Now, let us apply the preceding results to the problem of melting of a solid 
body (for simplicity, we shall choose the one-dimensional Einstein model; see 
complement A v ). Experiments show that the solid melts when AX is of the order 
of an appreciable fraction of the interatomic distance d. Consequently, the melting 
point temperature 77is given approximately by: 



(59) 


where £ can be replaced by its expression (52), with T = T f . Assuming that T f is large 
enough that kT f > fico E , we can use in (59) the asymptotic form (57)*, and we 
obtain the law for 7} : 


kTf 2 J2 

-^ rf d 

(60) 

m<x>l 


If we set: 


Jh» e = k& E 

(61) 


(0 E is called the “Einstein temperature”) and if we note that d does not vary very 
much from one substance to another (anyway, much less than co £ , that is, @ E ), 
we find the approximate law: 


Tl ~ c ,e 

m©\ 


(62) 


The melting point temperature of a crystal is therefore approximately proportional 
to the square of a vibrational frequency which is characteristic of the crystal. 


d. BLOCH'S THEOREM 


Consider the operator e iqX , where q is a real variable. Its mean value: 

< t~ iqX > - Tr [p e~ iqX ] (63) 


[where p is given by (1)] is a function of q y which we shall denote by f(q ): 

/(?) = <e"‘* x > (64) 

In probability theory, f(q) is called the characteristic function of the random variable x. 

It is easy to calculate f(q) if we place ourselves in the { | x ) } representation: 


m 



dx < x | p z~ tqX | 
dx < x | p | x ) e 
dx p(x) Q~ iqx 


X> 


-iqx 


(65) 


* This is not always possible. Recall that helium remains liquid at atmospheric pressure, even at 
T = 0; £ is never negligible compared to d\ whatever T may be (cf. complement A v ). 
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To within a factor of V^TC, f(q) is therefore the Fourier transform of the function p(x) calculated 
above (§ 4-b). Since p(x) is a Gaussian [formula (55)], f(q) is also a Gaussian [cf appendix I, 
relation (50)]: 

m = e - ?2 - 2 ' 4 ( 66 ) 

which, according to formula (56), can be written: 


< c iqX > = e"* v ' (67) 

We could perform calculations analogous to those of §§ 4-a and b above for the obser¬ 
vable P instead of X. We would then define the probability density p(p) by: 

p(p ) = z - 1 X e" EnlkT \<P„{p )\ 2 (68) 

n 

Formula (24) of complement D v shows that: 


p(p) =-P * 

mw \ 

Therefore: 


p(p) — - — e m 2 a > 2 $ 2 

mtDQ v n 

Consequently, the study of < e~ iqP ) would lead to the same result as in (67): 


< e~ iqP > - e"T^ ' (71) 

The generalization of formulas (67) and (71) is known as Bloch’s theorem : if G(X , P) 
is an arbitrary linear combination of the position X and the momentum P of a one-dimensional 
harmonic oscillator in thermodynamic equilibrium at the temperature T, then: 

q 2 

< e~* G > = e 2 C2> (72) 

This theorem is used in solid state physics, for example in the theory of emission without recoil 
by the nuclei of a crystalline lattice (the Mossbauer effect). 


References and suggestions for further reading: 

Specific heats: Kittel (8.2), chap. 6, p. 91 and 100; Kittel (13.2), chap. 6; 
Seitz (13.4), chap. Ill; Ziman (13.3), chap. 2. 

Blackbody radiation: Eisberg and Resnick (1.3), chap. 1; Kittel (8.2), chap. 15; 
Reif (8.4) §§9-13 to 9-15; Bruhat (8.3), chap. XXII. 

Bloch’s theorem: Messiah (1.17), chap. XII, §11-12. 
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Complement M v 
EXERCISES 


1. Consider a harmonic oscillator of mass m and angular frequency co. At 
time t = 0, the state of this oscillator is given by: 

I <A(o) > = I c„ | (p„ > 

n 

where the states | <p n > are stationary states with energies (n + 1/2 )Tuo. 

a. What is the probability that a measurement of the oscillator's energy 
performed at an arbitrary time t > 0, will yield a result greater than 2Tuo ? 
When & = 0, what are the non-zero coefficients c n ? 

b. From now on, assume that only c 0 and are different from zero. Write the 
normalization condition for |^(0)) and the mean value (H) of the energy in 
terms of c 0 and c t . With the additional requirement < H > = hco, calculate |c 0 | 2 
and |c,| 2 . 

c. As the normalized state vector 11^(0) > is defined only to within a global 
phase factor, we fix this factor by choosing c 0 real and positive. We set: c x = |c t | e l9 ‘. 
We assume that < H > = Tuo and that: 

< X >=1 f ~ 

2 \J mw 

Calculate 0,. 

d. With |^(0)) so determined, write |t (/(t)} for t > 0 and calculate the 
value of 0, at t. Deduce the mean value < Xy(t) of the position at t. 


2. Anisotropic three-dimensional harmonic oscillator 


In a three-dimensional problem, consider a particle of mass m and of potential 
energy: 


v{x, y, z) = 



1 +y ) (X 2 + Y 2 ) + 



where to and X are constants which satisfy: 
co ^ 0 , 

4 

a. What are the eigenstates of the Hamiltonian and the corresponding 
energies? 
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b. Calculate and discuss, as functions of X, the variation of the energy, the 
parity and the degree of degeneracy of the ground state and the first two excited 
states. 


3. Harmonic oscillator: two particles 

Two particles of the same mass m, with positions X t and X 2 and momenta P t 
and P 2 , are subject to the same potential: 

V{X) = jmco 2 X 2 

The two particles do not interact. 

a. Write the operator H, the Hamiltonian of the two-particle system. Show 
that H can be written: 

H = H x + H 2 

where H x and H 2 act respectively only in the state space of particle (1) and in that 
of particle (2). Calculate the energies of the two-particle system, their degrees 
of degeneracy, and the corresponding wave functions. 

b. Does H form a C.S.C.O. ? Same question for the set { H x , H 2 }. We denote 

by | ) the eigenvectors common to H x and H 2 . Write the orthonormalization 

and closure relations for the states | # nii „ 2 >. 

c. Consider a system which, at t = 0, is in the state : 

I m >=| (K„ > +1 > +1 > +1 » 

What results can be found, and with what probabilities, if at this time one measures: 

— the total energy of the system ? 

— the energy of particle (1)? 

— the position or velocity of this particle ? 

4. (This exercise is a continuation of the preceding one and uses the same nota¬ 
tion.) 

The two-particle system, at / = 0, is in the state 1 1 //( 0 )) given in exercise 3. 

a. At t — 0, one measures the total energy H and one finds the result 2 Tuo. 

a. Calculate the mean values of the position, the momentum, and the 
energy of particle (1) at an arbitrary positive t. Same question for particle (2). 

0. At t > 0, one measures the energy of particle (1). What results can be 
found, and with what probabilities? Same question for a measurement of the 
position of particle (1); trace the curve for the corresponding probability 
density. 
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b. Instead of measuring the total energy H, at t = 0, one measures the 
energy H 2 of particle (2); the result obtained is hcojl. What happens to the 
answers to questions a and f} of a? 

5. (This exercise is a continuation of exercise 3 and uses the same notation.) 

We denote by | # ni- „ 2 > the eigenstates common to H x and H 2 , of eigen¬ 
values (+ 1/2 )hco and (n 2 + \/2)Ttu>. The “two particle exchange” operator P e is 
defined by: 

p \<p > = \<p > 

L e | ,f!2 / I / 

a. Prove that P e 1 = P e and that P e is unitary. What are the eigenvalues 
of P e ? Let B' = P e BP x e be the observable resulting from the transformation by P e 
of an arbitrary observable B. Show that the condition B' = B (B invariant under 
exchange of the two particles) is equivalent to [5, P e ~\ = 0. 

b. Show that: 

P e H x P\ = H 2 

P e H 2 Pl = H x 

Does H commute with P e ? Calculate the action of P e on the observables X x , 
P t ,X 2 ,P 2 . 

c. Construct a basis of eigenvectors common to H and P e . Do these two 
operators form a C.S.C.O. ? What happens to the spectrum of H and the degeneracy 
of its eigenvalues if one retains only the eigenvectors | #) of H for 
which P e |tf>>=-|<Z>>? 


6. Charged harmonic oscillator 
in a variable electric field 

A one-dimensional harmonic oscillator is composed of a particle of mass m, 

charge q and potential energy V(X) = ^ ma> 2 X 2 . We assume in this exercise that 

the particle is placed in an electric field $(t) parallel to Ox and time-dependent, 
so that to V(x) must be added the potential energy: 

W(t) = - qS{t)X 

a. Write the Hamiltonian H(t) of the particle in terms of the operators a 
and a\ Calculate the commutators of a and a 1 with H(t). 

b. Let a(t) be the number defined by: 

“(0 = < </'(*) M </'(')> 

where | \J/(t)) is the normalized state vector of the particle under study. Show from 
the results of the preceding question that a (t) satisfies the differential equation: 

a (t) = — /to a(f) + i/.(t) 
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where X{t) is defined by: 


a(0 


\J 2 mh(o 


m 


Integrate this differential equation. At time t, what are the mean values of the 
position and momentum of the particle ? 

c. The ket | (p(t) ) is defined by: 

I <P{()> = [a - a(0] \ 

where a(/) has the value calculated in b. Using the results of questions a and b, 
show that the evolution of | (p(t) ) is given by: 

to ^ I <P {0 > = W) + *«>] I V(t ) > 

How does the norm of | (p(t )> vary with time? 

d. Assuming that |^(0)> is an eigenvector of a with the eigenvalue a(0), 
show that 11^(0 > is also an eigenvector of a , and calculate its eigenvalue. 

Find at time t the mean value of the unperturbed Hamiltonian 

H 0 = H(t) - W(t) 


as a function of <x(0). Give the root-mean-square deviations AX, AP and AH 0 ; how 
do they vary with time ? 

e. Assume that at t = 0, the oscillator is in the ground state | (p 0 ). The electric 
field acts between times 0 and T and then falls to zero. When t > T, what is the 
evolution of the mean values <A r }(t) and <P>(r)? Application : assume that 
between 0 and T, the field S(t) is given by S'(t) = S 0 cos (cu7); discuss the phenomena 
observed (resonance) in terms of Aa> = (o' — a>. If, at t > T, the energy is measured, 
what results can be found, and with what probabilities ? 


7. Consider a one-dimensional harmonic oscillator of Hamiltonian H and 
stationary states | <p„ >: 

H I <P* > “ (» + l/2)fia> | (f>„ > 

The operator U ( k) is defined by: 

U(k) = e ikX 
where k is real. 

a. Is U(k) unitary? Show that, for all n, its matrix elements satisfy the 
relation: 

EKp.I U{k)\(p„,}\ 2 = 1 

n r 

b. Express U(k) in terms of the operators a and a'. Use Glauber’s formula 
[formula (63) of complement B„] to put U (&) in the form of a product of exponential 
operators. 
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c. Establish the relations: 
| <Po > = I <Po > 

< V„ I I Vo > ” - 7 = 

V n\ 


where X is an arbitrary complex parameter. 

d. Find the expression, in terms of E k = fi 2 k 2 /2m and E m = Tuo, for the 
matrix element: 

< cpo | U(k) | cp n > 

What happens when k approaches zero? Could this result have been predicted 
directly? 


8 . The evolution operator U(t, 0) of a one-dimensional harmonic oscillator is 
written: 

U{t, 0) = e~ iHtin 

with: 



a. Consider the operators: 
a(t) = U\t, 0) a U(t, 0) 
a'{t) = U\t, 0) a 1 U(t , 0) 

By calculating their action on the eigenkets \(p„} of H, find the expression 
for a{t) and a'(t) in terms of a and a *. 

b. Calculate the operators X{t) and P(t) obtained from X and P by the 
unitary transformation: 

X{t) = U\t, 0) X U{t, 0) 

?{t) = u\t, 0) P U(t, 0) 

How can the relations so obtained be interpreted ? 

c. Show that | x > is an eigenvector of P and specify its eigenvalue. 

Similarly, establish that 0^ | p > is an eigenvector of X. 

d. At t — 0, the wave function of the oscillator is 0). How can one 

obtain from 0) the wave function of the oscillator at all subsequent 

times t q = qn/2co (where q is a positive integer)? 
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e. Choose for i j/(x, 0) the wave function <p„(x) associated with a stationary 
state. From the preceding question derive the relation which must exist between (p„(x) 
and its Fourier transform 

f. Describe qualitatively the evolution of the wave function in the following 
cases: 


(i) <p(x, 0) = e ikx where k, real, is given. 

(it) \J/{x, 0) = e~ px where p is real and positive, 
(tit) 


o) 



1=0 



everywhere else 


(iv) ^(x,0) = e plxl where p is real. 
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A. INTRODUCTION: 

THE IMPORTANCE OF ANGULAR MOMENTUM 


The present chapter is the first in a series of four chapters (VI, VII, IX and X) 
devoted to the study of angular momenta in quantum mechanics. This is an 
extremely important problem, and the results we are going to establish are used 
in many domains of physics : the classification of atomic, molecular and nuclear 
spectra, the spin of elementary particles, magnetism, etc... 

We already know the important role played by angular momentum in 
classical mechanics; the total angular momentum of an isolated physical system 
is a constant of the motion. Furthermore, this is also true in certain cases in which 
the system is not isolated. For example, if a point particle P, of mass m, is moving 
in a central potential (one which depends only on the distance between P and 
a fixed point O), the force to which P is subjected is always directed towards O. 
Its moment with respect to O is consequently zero, and the angular momentum 
theorem implies that: 


_d 

dt 


if = 0 


(A-l) 


where jSf is the angular momentum of P with respect to O. This fact has important 
consequences: the motion of the particle P is limited to a fixed plane (the plane 
passing through O and perpendicular to the angular momentum if); moreover, 
this motion obeys the law of constant areal velocity (Kepler’s second law). 

All these properties have their equivalents in quantum mechanics. With the 
angular momentum if of a classical system is associated an observable L, actually 
a set of three observables, L x , L y and L z , which correspond to the three components 
of if in a Cartesian frame. If the physical system under study is a point moving 
in a central potential, we shall see in chapter VII that L x , L y and L z are constants 
of the motion in a quantum mechanical sense, that is, they commute with the 
Hamiltonian //describing the particle in the central potential V(r). This important 
property considerably simplifies the determination and classification of eigenstates 
of H. 

Also, we described the Stern-Gerlach experiment in chapter IV and this 
revealed the quantization of angular momentum : the component, along a fixed 
axis, of the intrinsic angular momentum of an atom can take on only certain 
discrete values. We shall see that all angular momenta are quantized in this way. 
This enables us to understand atomic magnetism, the Zeeman effect, etc... Further¬ 
more, to analyze all these phenomena, we must introduce typically quantum 
mechanical angular momenta, which have no classical equivalents (intrinsic angular 
momenta of elementary particles; chap. IX). 

From now on, we shall denote by orbital angular momentum any angular 
momentum which has a classical equivalent (and by L, the corresponding obser¬ 
vables), and by spin angular momentum any intrinsic angular momentum of an 
elementary particle (for which we reserve the letter S). In a complex system, such 
as an atom, a nucleus or a molecule, the orbital angular momenta L f of the various 
elementary particles which constitute the system (electrons, protons, neutrons,...) 
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combine with each other and with the spin angular momenta S, of these same 
particles to form the total angular momentum J of the system. The way in which 
angular momenta are combined in quantum mechanics (coupling of angular 
momenta) will be studied in chapter X. Finally, let us add that J will also be used 
to denote an arbitrary angular momentum when it is not necessary to specify 
whether we are dealing with an orbital angular momentum, a spin, or a combination 
of several angular momenta. 

Before beginning the study of the physical problems just mentioned (energy 
levels of a particle in a central potential, spin, the Zeeman effect, addition of angular 
momenta,...), we shall establish, in this chapter, the general quantum mechanical 
properties associated with all angular momenta, whatever their nature. 

These properties follow from commutation relations satisfied by the three 
observables J x , J y and J z , the components of an arbitrary angular momentum J. 
The origin of these commutation relations is discussed in §B : for an orbital angular 
momentum, they are simply consequences of the quantization rules (§B-5 of 
chapter III)and thecanonical commutation relations [formulas (E-30)ofchapter II]; 
for spin angular momenta, which have no classical equivalents, they actually serve as 
definitions of the corresponding observables*. In §C, we study the consequences 
of these commutation relations which are characteristic of angular momenta. 
In particular, we discuss space quantization, that is, the fact that any component 
of an angular momentum possesses a discrete spectrum. Finally, the general 
results so obtained are applied, in §D, to the orbital angular momentum of a particle. 


B. COMMUTATION RELATIONS CHARACTERISTIC 
OF ANGULAR MOMENTUM 


1. Orbital angular momentum 

To obtain the observables L x , L y , L z associated in quantum mechanics with 
the three components of the angular momentum if of a spinless particle, we simply 
apply the quantization rules stated in §B-5 of chapter III. If we consider the 
component Z£ x of the classical angular momentum: 

= yPz ~ Z Py (B-l) 

We associate with the position variables y and z, the observables Y and Z, and with 
the momentum variables/?,, and p z , the observables P y and P z . Although formula (B-1) 
involves products of two classical variables, no precautions need be taken in replacing 
them by the corresponding observables, since Y and P z commute, as do Z and P y [see 
the canonical commutation relations (E-30) of chapter II]. We therefore do not need 
to symmetrize expression (B-l) in order to obtain the operator L x : 

L x = YP Z - ZP y (B-2) 


* The fundamental origin of these commutation relations is purely geometrical. We shall discuss 
this point in detail in complement B VI , in which we demonstrate the intimate relation between the angular 
momentum of a system with respect to a point O and the geometrical rotations of this system about O. 
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For the same reason (Y and P z commute, as do Z and P y ), the operator thus obtained 
is Hermitian. 

In the same way, we find the operators L y and L z corresponding to the 
components y and i£ z of the classical angular momentum. This allows us to write: 

L = R x P (B-3) 

Since we know the canonical commutation relations of the position R and 
momentum P observables, we can easily calculate the commutators of the ope¬ 
rators L x , L y and L z . 

For example, let us evaluate [L x , L y ]: 

[L x ,L y ] = [YP Z - ZP y , ZP X - XP Z ] 

= [ YP z ,ZP x ] + [ZP y ,XP z ] (B-4) 

since YP Z commutes with XP., and ZP y , with ZP X . We then have: 

[L x ,L y ] = Y[P z ,Z]P x + X[Z,P 2 ]P y 
= - ih YP X + ih XP y 

= ih L z (B-5) 

Analogous calculations yield the other two commutators, and we obtain, finally: 

= ih L. 

[L y ,L z ] = ih L x 

[L Z ,L X ] = ih L y (B-6) 

Thus we have established the commutation relations for the components of the 
angular momentum of a spinless particle. 

This result can immediately be generalized to a system of N spinless particles. 
The total angular momentum of such a system is, in quantum mechanics: 

L= £l, (B-7) 

r = i 

with: 

L,= R, x P f (B-8) 

Now, each of the individual angular momenta L ; satisfies the commutation rela¬ 
tions (B-6) and commutes with L,- when j is not equal to i (operators acting in state 
spaces of different particles). Thus we see that relations (B-6) remain valid for the 
total angular momentum L. 


2. Generalization : definition of an angular momentum 

The three operators associated with the components of an arbitrary classical 
angular momentum therefore satisfy the commutation relations (B-6). It can be 
shown, moreover (cf. complement B VI ), that the origin of these relations lies in 
the geometric properties of rotations in three-dimensional space. This is why we 
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shall adopt a more general point of view and define an angular momentum J as 
any set of three observables J x , J y , J z which satisfies: 


[J x ,J y ] = ihJ z 

\_Jy i Jz\ ~ l ^Jx 

[j. » Jx] = Jy 


(B-9) 


We then introduce the operator: 

J 2 = J 2 X + J] + J\ (B-10) 


the (scalar) square of the angular momentum J. This operator is Hermitian, 
since J x , J Y and J z are Hermitian. We shall assume that it is an observable. Let us 
show that J 2 commutes with the three components of J : 


I [J 2 . J] - 0 


(B- II) 


We perform the calculation for J x , for example: 

[J 2 , J x ] = [/ 2 + J y + J], J x ] 

= | Jy^x] + VI J*] ( B ‘ 12 ) 

since J x obviously commutes with itself and, therefore, with its square. The other two 
commutators can be obtained easily from (B-9). 

=/,[/„/J + [J y ,JxVy 

= - m J y J z - m J z J y (B-13-a) 

[J 2 Z ,J X ] = J Z [J Z ,J X ] + [J 2 ,J X ]J 2 

= iTt J z J y + m J y J z (B-13-b) 

The sum of these two commutators, which enters into (B-12), is indeed zero. 

Angular momentum theory in quantum mechanics is founded entirely on 
the commutation relations (B-9). Note that these relations imply that it is impossible 
to measure simultaneously the three components of an angular momentum; 
however, J 2 and any component of J are compatible. 


3. Statement of the problem 

Let us return to the example of a spinless particle in a central potential, 
mentioned in the introduction. We shall see in chapter VII that, in this case, the 
three components of the angular momentum L of the particle commute with the 
Hamiltonian H ; thus, this is also true for the operator L 2 . We then have at our 
disposal four constants of the motion: L 2 , L x , L y , L z . But these four operators do 
not all commute; to form a complete set of commuting observables with H , we 
must pick only L 2 and one of the three other operators, L. for example. For a 
particle subject to a central potential, we can then look for eigenstates of the 
Hamiltonian H which are also eigenvectors of L 2 and L z , without restricting the 
generality of the problem. However, it is impossible to obtain a basis of the state 
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space composed of eigenvectors common to the three components of L, as these 
three observables do not commute. 

The situation is the same in the general case: since the components of an 
arbitrary angular momentum J do not commute, they are not simultaneously 
diagonalizable. We shall therefore seek the system of eigenvectors common to J 2 
and J z , observables corresponding to the square of the absolute value of the angular 
momentum and to its component along the Oz axis. 


C. GENERAL THEORY OF ANGULAR MOMENTUM 

In this section, we shall determine the spectrum of J 2 and J z for the general case 
and study their common eigenvectors. The reasoning will be analogous to the one 
used in chapter V for the harmonic oscillator. 


1. Definitions and notation 

a. THE AND J _ OPERATORS 

Instead of using the components J x and J y of the angular momentum J, it is 
more convenient to introduce the following linear combinations: 


j_ = J x - iJ } j 

Like the operators a and a* of the harmonic oscillator, J + and J _ are not Hermitian : 
they are adjoints of each other. t 

In the rest of this section, we shall use only the operators /+, J. 
and J 2 . It is straightforward, using (B-9) and (B-II), to show that these operators 
satisfy the commutation relations: 


[y z ,7 + ] =*J + (C-2) 

[/„/_] = - hJ_ (C-3) 

[J + ,J_] = 2hJ z (C-4) 

[J 2 , J + ] = [J 2 , J _] = [J 2 , J z ] = 0 (C-5) 

Let us calculate the products J+J _ and J_J + . We find: 

J + J. = (J x + iJ y ){J x - iJ y ) 

= Jl + Jy — 1 [/*< Jy] 

= J 2 x +J 2 y+hJ z (C-6-a) 

/_7 + = (J x - U y )(J X + Uy) 

= Jl +J 2 y +i[j^Jy] ( C - 6 ' b ) 

= / 2 + Jy ~hJ z 
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Using definition (B-10) of the operator J 2 , we can write these expressions in the 
form: 

J + J - = J 2 -J\ + bJ z (C-7-a) 

J_J + = J 2 -J 2 z -bJ z (C-7-b) 

Adding relations (C-7) term by term, we obtain: 

J 2 +J-J + ) + J 2 z (C-8) 

b. NOTATION FOR THE EIGENVALUES OF J* AND J, 

According to (B-10), J 2 is the sum of the squares of three Hermitian operators. 
Consequently, for any ket | ij /), the matrix element < i^ | J 2 1 ip ) is positive or zero: 

<i//|J 2 |^> = <i//|/ 2 |^> + <i//|/ 2 |^> + <^|7 2 |i//> 

= |y* | * >|| 2 + | j y | + >|| 2 + ||y z | * >|| 2 > o (c-9) 

Note that this could have been expected, since J 2 corresponds to the square of the 
absolute value of the angular momentum J. From this we see, in particular, that 
all the eigenvalues of J 2 are positive or zero, since if 1 1 // > is an eigenvector of J 2 , 
< tj /1 J 2 1 1 // > is the product of the corresponding eigenvalue and the square of the 
norm of | ^ ), which is always positive. 

We shall write the eigenvalues of J 2 in the form j(j + 1 )b 2 , with the convention: 

j 2* 0 (C-10) 

This notation is intended to simplify the arguments which follow; it does not 
influence the result. Since J has the dimensions of ft, an eigenvalue of J 2 is necessarily 
of the form Xb 2 , where A is a real dimensionless number. We have just seen that X 
must be positive or zero; it can then easily be shown that the second-degree 
equation in j : 

j{j + \) = X (C-ll) 

always has one and only one positive or zero root. Therefore, if we use (C-10), 
the specification of X determines j uniquely; any eigenvalue of J 2 can thus be written 
j{j + 1 )Ti 2 , with j positive or zero. 

As for the eigenvalues of J z , which have the same dimensions as b, they 
are traditionally written mb, where m is a dimensionless number. 

c. EIGENVALUE EQUATIONS FOR J 2 AND J 2 

We shall label the eigenvectors common to J 2 and J z by the indices j and m 
which characterize the associated eigenvalues. However, J 2 and J z do not in general 
constitute a C.S.C.O. (see, for example, §A of chapter VII), and it is necessary to 
introduce a third index in order to distinguish between the different eigenvectors 
corresponding to the same eigenvalues j(j + l)7i 2 and mb of J 2 and J z (this point 
will be expanded in § 3-a below). We shall call this index k (which does not necessarily 
imply that it is always a discrete index). 
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We shall therefore try to solve the simultaneous eigenvalue equations: 

J 2 | k, j, m > = j (j+ 1 )h 2 | k, j, m > 

J z | kj , m > = mh \ k,J, m > (C-12) 

2. Eigenvalues of J 2 and J z 

As in § B-2 of chapter V, we shall begin by proving three lemmas which will 
then enable us to determine the spectrum of J 2 and J z . 

a. LEMMAS 

a. Lemma I (Properties of the eigenvalues of A 2 and J.) 

If j(j + l)h 2 and mh are the eigenvalues of J 2 and J z associated with the 


same eigenvector | k,j,m >, then j and m satisfy the inequality : 

-j^m^j (C-I3) 

To prove this, consider the vectors J+ \ k,f m > and J_ | k,j, m}, and note 
that the square of their norms is positive or zero: 

||/+ | k,j, m >|| 2 = < k,j, m | J_J + | k,j, m > ^0 (C-14-a) 

jj/_ | kj, m >|j 2 = < k,j, m \ J_ j k,j, m > ^0 (C-14-b) 

To calculate the left-hand sides of these inequalities, we can use formulas (C-7). 
We find (if we assume \k,j, m > to be normalized): 

< kj, m | | kj, m > = < k,f m | (J 2 - J 2 - hJ z ) \ k,fm} 

= j(j + l)Ii 2 — m 2 h 2 — mh 2 (C-15-a) 

< kj, m \J+J- | k,j, m > = < k,j , m | (J 2 - J 2 + hJ z ) | kj, m > 

= j{j + \)h 2 — m 2 h 2 + mh 2 (C-15-b) 

Substituting these expressions into inequalities (C-14), we obtain : 

j(j + 1) — m ( m + !)=(/ — m)(j + m + 1) ^ 0 (C-16-a) 

j(j + 1) — m(m - 1 ) = (j — m + \){f + m) ^ 0 (C-16-b) 

that is: 

- (/ + 1) < m < j (C-17-a) 

— j ^ m ^ j + 1 (C-17-b) 


These two conditions are satisfied simultaneously only if m satisfies inequality (C-l 3). 

|3. Lemma II (Properties of the vector J _ \k,j,my) 

Let | k,j, m) be an eigenvector of J 2 and J z with the eigenvalues j(j + 1 )h 2 
and mh. 

(i) If m = - j, J_ \k,j, - j > = 0. 

(ii) If m > — j , J _ | k,j, my is a non-null eigenvector of J 2 and J z with the 
eigenvalues j(j + 1 )h 2 and (m — 1 )h. 
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(/) According to (C-15-b), the square of the norm of J_ \k,j, m} is equal 
to h 2 \J(j + 1) — m(m — 1)] and therefore goes to zero for m = — j. Since the 
norm of a vector goes to zero if and only if this vector is a null vector, we conclude 
that all vectors J _ | k, j, — j ) are null: 


s 

II 

1 

1 

1 

II 

O 

(C-18) 

It is easy to establish the converse of (C-18): 

J- | kj, m y = 0 => m = - j 

(C-19) 

Letting J + act on both sides of the equation appearing in (C-19). and using (C-7-a). 
we obtain: 

h 2 [j(j + 1) - m 2 + m\ | kj, m > 

= h 2 (j + m)(j - m + l) | k,j, m > = 0 

(C-20) 

Using (C-13), (C-20) has only one solution, m = — j. 

(ii) Now assume m to be greater than — j. According to (C-15-b), J_ | k,j, m ) 
is then a non-null vector since the square of its norm is different from zero. 

Let us show that it is an eigenvector of J 2 and J z . The operators /_ and J 2 
commute; consequently: 

[J 2 , /_] \k,j,my = 0 

(C-21) 

which can be written: 


J 2 y_ | k, j, m y = J_ J 2 | k,j, m > 

= j(j + 1 )f> 2 J- \kj,m y 

(C-22) 

This relation expresses the fact that J_ \k,j, my is an eigenvector of J 2 
eigenvalue j(j + 1 )h 2 . 

Moreover, if we apply operator equation (C-3) to | k,j, m ): 

with the 

[J z , Jj\ | kj, m y = - TiJ_ | kj, m > 

(C-23) 

that is: 


J Z J- | k,j, my = J_J Z | k,j, my - TtJ_ \ kj , m > 

= mhJ_ | k, /, my — hJ_ | k, /', m > 

= (m — 1 )h J _ | k,j, m > 

(C-24) 


/_ \k,j, my is therefore an eigenvector of J z with the eigenvalue (m — l)fi. 


y. Lemma III (Properties of the vector J + \ k,j, m >) 

Let | k, j, m ) be an eigenvector of J 2 and J, with the eigenvalues j(j + 1 )h 2 
and mh. 

(i) If m =j, J + | k,jjy = 0 . 

( ii) If m < j , J + | k,j, my is a non-null eigenvector of J 2 and J z with the 
eigenvalues j(j + 1 )h 2 and (m + I )h. 
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(i) The argument is similar to that of §C-2-a-f$. According to (C-14-a), the 


square of the norm of J + \ k, j, m > is zero if m = j. Therefore: 

m = j => J + | kJJ > = 0 (C-25) 

The converse can be proved in the same way : 

J+ | kj , m > = 0 <=> m = j (C-26) 

(ii) If m < j , an argument analogous to that of § (3-(/7) yields, using for¬ 

mulas (C-5) and (C-2): 

J 2 J+ | kj, m > = j(j + \)h 2 J + | k,j, m > (C-27) 

J 2 J+ j kj, m } — (m + \)h J + \ kj , w > (C-28) 


b. DETERMINATION OF THE SPECTRUM OF J 2 AND J : 

We shall now show that the three lemmas above enable us to determine the 
possible values of j and m. 

Let \k,j,m} be a non-null eigenvector of J 2 and J z with the eigen¬ 
values j(j + 1 ft 2 and mh. According to lemma I, - j ^ m ^ j. It is therefore 
certain that a positive or zero integer p exists such that: 

- j ^ m - p <- j + 1 (C-29) 

Now consider the series of vectors: 

| kj , m >, J - | kj , m >, (y_) p | k,j, m > (C-30) 

According to lemma II, each of the vectors (7_)" | kj, m ) of this series (n = 0,1, ...,p) 
is a non-null eigenvector of J 2 and J 2 with the eigenvalues j(j + 1 )ft 2 and (m — n)Ti. 

The proof is by iteration. By hypothesis, | k,j,m} is non-null and cor¬ 
responds to the eigenvalues j(j + 1 )Ti 2 and mh. ( J_f\k,j,m > is obtained by the 
action of /_ on (y_)" _l \ k,j,m}, which is an eigenvector of J 2 and J 2 with the 
eigenvalues j(j + 1 )h 2 and (m — n + l)h, the latter eigenvalue being necessarily 
greater than — j, since, according to (C-29): 

m — n+l^m—p+l^ — j + 1 (C-31) 

It follows, according to point (ii) of lemma II, that (J-f |&, j, m ) is a non-null 
eigenvector ofJ 2 and J 2 , the corresponding eigenvalues beingy(y + 1 )h 2 and (m — n)h. 

Now let J_ act on (J_ f \ k, j, m ). Let us first assume that the eigen¬ 
value (m — p)h of J 2 associated with (J-Y\k,j, m} is greater than — jh, that is, 
that : 


m - p > - j (C-32) 

By point (ii) of lemma II, 7_(/_ ) p | k, j, m ) is then non-null and corresponds to the 
eigenvalues j(j + 1 )h 2 and (m — p — 1 )h. This is in contradiction with lemma I 
since, according to (C-29): 

m — p — 1 < — j (C-33) 
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We must therefore have m — p equal to — j. In this case, (J-) p \k,j, m} 
corresponds to the eigenvalue — j of J z , and, according to point (/) of lemma II, 
J-(J-) P | k,j, m} is zero. The vector series (C-30) obtained by the repeated action 
of J_ on | k,j,m} is therefore limited and the contradiction with lemma I is 
removed. 

We have now shown that there exists a positive or zero integer p such that: 


m - p = - j (C-34) 

A completely analogous argument, based on lemma III, would show that there 
exists a positive or zero integer q such that: 

m + q = j (C-35) 

since the vector series: 

| kj , m ),J + | kj, m >,..., (/ \ kj , m > (C-36) 

must be limited if there is to be no contradiction with lemma I. 

Combining (C-34) and (C-35), we obtain: 

P + q = 2j (C-37) 


j is therefore equal to a positive or zero integer divided by 2. It follows that j 
is necessarily integral or half-integral*. Furthermore, if there exists a non-null 
vector | k,j m ), all the vectors of series (C-30) and (C-36) are also non-null 
and eigenvectors of J 2 with the eigenvalue j(j + 1 )h 2 , as well as of J z with the 
eigenvalues: 

- jTi, (-j+l)h,(-j + 2)h ,..., (j -2)h,(j-\ % jfi (C-38) 

We summarize the results obtained above as follows: 


Let J be an arbitrary angular momentum, obeying the commutation 
relations (B-9). If j(j + 1 )1i 2 and nih denote the eigenvalues of J 2 and J 2 , 
then: 

— the only values possible for j are positive integers or half-integers 
or zero, that is: 0, 1/2, 1, 3/2, 2, ... (these values are the only ones possible, 
but they are not all necessarily realized for all angular momenta). 

— for a fixed value of j, the only values possible for m are the (2/ + 1) 
numbers : — j, — j + 1, ...,j — l,j; m is therefore integral if j is integral 
and half-integral if j is half-integral. All these values of m are realized if one 
of them is. 


3. « Standard » { | k.j. m > } representations 

We shall now study the eigenvectors common to J 2 and J T , which form a basis 
of the state space since J 2 and J z are, by hypothesis, observables. 


* A number is said to be “ half-integral” if it is equal to an odd number divided by 2. 
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a. THE BASIS STATES 

Consider an angular momentum J acting in a state space <f. We shall show 
how to construct an orthonormal basis in S composed of eigenvectors common 
to J 2 and J z . 

Take a pair of eigenvalues, j(j + l)fi 2 and mb, that are actually found 
in the case we are considering. The set of eigenvectors associated with this pair 
of eigenvalues forms a vector subspace of S which we shall denote by S(j, m ); the 
dimension g(j, m) of this subspace may well be greater than 1, since J 2 and J z do not 
generally constitute a C.S.C.O. We choose in S(j,m) an arbitrary orthonormal 
basis, {| k,j,m};k= 1 , 2 , g(j, m) }. 

If m is not equal to j, there must exist another subspace S{j , m + 1) in S 
composed of eigenvectors of J 2 and J z associated with the eigenvalues j(j + 1 )Ti 2 
and (m + 1 )ti. Similarly, if m is not equal to — j, there exists a subspace S(j, m — 1 ). 
In the case where m is not equal to j or — j, we shall construct orthonormal bases 
in S(j, m + 1) and in S(j, m — 1), starting with the one chosen in S(j, m). 

First, let us show that, if k 2 is not equal to k 2 ,J + | k t , j, m > and J + | k 2 ,J, m ) 
are orthogonal, as are m} and J _ | k 2 ,j, m ). We can find the scalar 

product of J ± \k l , j, m > and J ± \ k 2 , j, m} by using formulas (C-7): 

< k 2 ,j, m\J^J ± | k u j, m > = < k 2 ,j, m \ (J 2 - J\ + TiJ z ) \ k x ,j, m > 

= [j(J + 1) - m{m ± 1 )]b 2 < k 2 ,j, m \ k^j, m > 

(C-39) 

These scalar products are therefore zero if k x =£ k 2 since the basis of S(j, m) is 
orthonormal; if k t = k 2 , the square of the norm of J ± | k l ,j,m} is equal to: 

L/C/ + 0 - m ( m ± i)]^> 2 

Now let us consider the set of the g(j, m) vectors defined by: 

| k, j, m + 1 > =- 1 — •/+ | k,j, m > (C-40) 

b\/j(j + 1) — m(m + 1) 

Because of what we have just shown, these vectors are orthonormal. We shall 
show that they constitute a basis in S{j, m + 1). Assume that there exists, 
in S(j, m + 1), a vector | a ,j,m + 1 > orthogonal to all the | k,j,m + 1 > obtained 
from (C-40). The vector /_ | a, y, m + 1 ) would not be null since (m + 1) cannot 
be equal to — j; it would belong to S(j,m) and would be orthogonal to all 
vectors \k,j, m + 1 ). Now, according to (C-40), J_ \ k, j, m + 1) is propor¬ 
tional to \k,j, m}, that is, to | k,j,m) [formula (C-7-b)]. Therefore, 

/_ | a, j, m + 1 ) would be a non-null vector of S(j, m) which would be orthogonal 
to all vectors of the { | k, j, m > } basis. But this is impossible. Consequently, the set 
of vectors (C-40) constitutes a basis in S(j, m + 1). 

It can be shown, using a completely analogous argument, that the 
vectors | k,j,m — 1 > defined by: 

1 >- 7 = ...L = == /_ | k,j, m > (C-4I) 

Ti \' j(j 4 - 1 ) — m(m — 1 ) 
form an orthonormal basis in S(j, m — 1). 
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We see, in particular, that the dimension of subspaces S(j\ m + 1) and 
S(j, m — 1) is equal to that of $(j, m). In other words, this dimension is inde¬ 
pendent of m*: 

g{j, m + 1 ) = g{j,m - 1) = g{j, m) = g(j) (C-42) 

We then proceed as follows. For each value of j actually found in the 
problem under consideration, we choose one of the subspaces associated with 
this value of j, for example, ${j, j), which corresponds to m = j. In this subspace, 
we choose an arbitrary orthonormal basis, { | k,j,j};k = 1,2,... ,g(j )}. Then, 
using formula (C-41), we construct, by iteration, the basis to which each of the 
other 2j subspaces S(j, m) will be related : the arrows of table (VI-1) indicate the 
method used. By treating all the values of j found in the problem in this way, we 


Table VI. 1 

Schematic representation of the construction of the (2/ + 1 )y(j) vectors of a “ standard basis” associated 
with a fixed value of/. Starting with each of the g[j ) vectors | &,/. / > of the first line, one uses the action of 
to construct the (2 j + 1) vectors of the corresponding column. 

Each subspace &{j\ m) is spanned by the g(j) vectors situated in the same row. Each subspace &(kj) 
is spanned by the (2 j + 1) vectors of the corresponding column. 


g(j) different values of k 


( 2 /+ 1 ) 
spaces 
S(j, m) 



k = 1 

2 

gin 

' S\j, m = j) 

11 jsjy 

| 2 , 7,7 > - 


1 7 - 

i- 7 - 

U- 

U- 

#(j, m=j- 1) 

11 jj - i > 

| 2 JJ - 1 > ... 

<— 

i 

I 

i j - 

I'- 


^(/. m ) 

| 1 J, m > 

| 2,y, m > ... 

g{j)J , m > 

I 

V- 


y- 

. i 

. ^(/, m = - j) 

| 1 J, ~ j > 

| 2,/, -/>... | 

1 - ./> 

i 


S(k = 1,./) S(k = 2 J) S{k = g{j\j) 

\ ___ J 


g(j) spaces S(kJ) 


* If this dimension is infinite, the result must be interpreted as follows : there is a one-to-one 
correspondence between the basis vectors of two subspaces corresponding to the same value of /. 
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arrive at what is called a standard basis of the state space S. The orthonormalization 
and closure relations for such a basis are: 

< k,j, m | k\j\ m' > = S kk . S jy 6 mm . (C-43-a) 

+ J 

X X X I m > < k,j , m 1 = 1 (C-43-b) 

j m — - j k = l 

COMMENTS t 

(/) The use of formula (C-41) implies a choice of phases : the basis vectors in 
§(j, m — 1) are chosen to be proportional, with a real and positive coefficient, 
to the vectors obtained by application of J_ to the basis of S c (j, m). 

(ii) Formulas (C-40) and (C-41) are compatible, since, if we apply J + to both 
sides of (C-41) and take (C-7-a) into account, we find (C-40) [with m replaced 
by (m — 1)]. This means that one is not obliged to start, as we did, with the 
maximum value m = j and (C-41) in order to construct bases of the 
subspaces S(j, m) corresponding to a given value of j. 


In most cases, in order to define a standard basis, one uses observables A, B, ... which 
commute with the three components of J* and form a C.S.C.O. with J 2 and J. (we shall see 
a concrete example of this in §A of chapter VII): 

[A, J] = [5, J] = ... = 0 (C-44) 

For the sake of simplicity, we shall assume that only one of these observables A is 
required to make a C.S.C.O. with J 2 and J z . Under these conditions, each of the subspaces S (/, m) 
defined above is globally invariant under the action of A : if | i l/ J m ) is an arbitrary vector 
of 8(j, m), A | ip j m > is still, according to (C-44), an eigenvector of J 2 and 

J 2 '* I 'I'i.m > = AJ 1 | ij/ Jm > = j(j + l)Ji 2 A | \!/ jm > 

J Z A | 4> } . m > = A J z | ikj. m > = nth A | > (C-45) 

with the same eigenvalues as | i l/ j m ). Thus A \ \j/ j m ) also belongs to $(j, m). If we then choose 
a value ofy, we can diagonalize A inside the corresponding subspace We denote by a k J the 

various eigenvalues found in this way : the index j indicates in which space S(jJ) they were found, 
and the index k (assumed to be discrete, for simplicity) distinguishes between them. A single 
vector (written | k,j> j >) of $ (y, j) is associated with each eigenvalue a kj9 since A , J 2 and J 2 
form, by hypothesis, a C.S.C.O.: 

A | kJJ > = a k j | kJJ > (C-46) 

The set { | k , y, j >; j fixed; k = 1, 2, ..., g(j) } constitutes an orthonormal basis in <f(y, y), 

from which we construct, using the method described above, a basis in the other subspaces $ (y, m) 
related to the value of j chosen. By applying this procedure successively for all values of y, 

we arrive at a “standard” basis, { | k> y, m > } of the state space, all of whose vectors are 

eigenstates, not only of J 2 and J z , but also of A : 

A | k, y, m > = a kJ \k,j,m} (C-47) 


* An operator which commutes with the three components of the total angular momentum of a 
physical system is said to be “scalar” (cf. complement B V i). 
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This can be shown as follows. If hypothesis (C-44) is satisfied, A commutes with , which 
means that J _ | k 9 j,j >, that is, | k 9 j 9 j — 1 >, is an eigenvector of A with the same eigenvalue 
as | k,j,j >: 

AJ - I k,j,j > = J-A | k,j,j > = a kij J- | &JJ > (C-48) 

By repeating this process, it is easy to prove relation (C-47). 

comments: 

(i) An observable which commutes with J 2 and J z does not necessarily commute with J x 
and J y (J 2 is itself an example). Consequently, it should not be necessary, in order to 
form a C.S.C.O. with J 2 and J z , to choose observables which commute with the three 
components of J as in (C-44). However, if A did not commute with J. and /_ (that is, 
with J x and J y ), J ± \ k,j, m > would not necessarily be an eigenvector of A with the same 
eigenvalue as | k, j, m ). 

(ii) The spectrum of A is the same in all the subspaces S(j, m) associated with the same 
value of j. However, the eigenvalues a k ] - generally depend on j (this point will be 
illustrated by concrete examples in §§ A and C of chapter VII). 


b. THE SPACES <f(*. j) 

In the preceding section, we introduced a “standard basis” of the state 
space by starting with a basis chosen in the subspace S{j,m = j) and constructing 
a basis of S(j, m = j — 1), then one of ${j, m = j — 2),..., &(j, m ), etc... The state 
space can be considered to be the direct sum of all the orthogonal subspaces <?(/, m), 
where m varies by integral jumps from — j to + j and j takes on all the values 
actually found in the problem. This means that any vector of § can be written 
in one and only one way as a sum of vectors, each belonging to a particular 
subspace ${j, m). 

Nevertheless, the use of the subspaces 6{j, m) presents certain disadvantages. 
First of all, their dimension g(j) depends on the physical system being considered 
and is not necessarily known. In addition, the subspaces $(j, m) are not invariant 
under the action of J since, by the very means of construction of the vectors | k, j, m >, 
J + and J_ have non-zero matrix elements between vectors of S(j, m) and those 
of m + 1). 

We shall therefore introduce other subspaces of $, the spaces ${k, j). Instead 
of grouping together the kets | k,j, m ) with fixed indic.es j and m [which span &(j, m)\ 
we shall now group together those for which k and j have given values, and we shall 
call ${k,j) the subspace which they span. This amounts to associating, in 
table (VI-1), the (2 j + 1) vectors of one column [instead of the g(j) vectors of 
one row], 

$ can then be seen to be the direct sum of the orthogonal subspaces S(k,j), 
which have the simpler properties : 

- the dimension of S{k,j) is (2 j + 1), whatever the value of k and whatever 
the physical system under consideration. 

— $[k,j) is globally invariant under the action of J: any component J u 
of J [or a function F(J) of J], acting on a ket of S(k,j), yields another ket also 
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belonging to §(k, j)*. This result is not difficult to establish, since J a [or F(J)] can 
always be expressed in terms of J z , J + and Now, J z , acting on \k,j,m}, 
yields a ket proportional to | k,j, m >; J + , a ket proportional to \k,j, m + 1 ); 
and a ket proportional to | k, j, m — 1 >. The existence of the property in 

question therefore follows from the very means of construction of the “standard 
basis” { \k,j,m} }. 

C. MATRICES REPRESENTING THE ANGULAR MOMENTUM OPERATORS 

Using the subspaces S{k,j ) considerably simplifies the search for the matrix 
which represents, in a “standard” basis, a component J u of J [or an arbitrary 
function /’(J)]. The matrix elements between two basis kets belonging to two 
different subspaces $(k,j) are zero. The matrix therefore has the following form : 


£(k',f) 



matrix 

(2j+l)x(2j+l) 

0 

0 

0 

*(k\f) 

0 

matrix 

(2/ + 1 ) x (2/ + 1 ) 

0 

0 

*(k',f) 

0 

0 

matrix 

(2/+ 1) x (2/+ 1) 

0 


0 

0 

0 

0 


(C-49) 


All we must then do is calculate the finite-dimensional matrices which represent the 
operator under consideration inside each of the subspaces <?(&,/). 

Another very important simplification arises from the fact that each of these 
finite submatrices is independent of k and of the physical system under study; it 
depends only on j and, of course, on the operator which we want to represent. 
To see this, note that the definition of the | k,j, m ) [cf (C-12), (C-40) and (C-41)] 
implies that: 


Jz | k,j, m > = mh \ kj, m > 

1 

J+ | kj, m > = Ti \/j(j + 1) - m(m + 1) | 
k, j, m > = + 1) - m(m - 1) 

kj, m + 1 > 

k,j, m - 1 > 


(C-50) 


* It can also easily be shown that £(k> j) is “ irreducible" with respect to J : there exists no sub¬ 
space of £(Jc, j) other than £(k , j) itself which is globally invariant under the action of the various com¬ 
ponents of J. 
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that is: 

< k, j, m\J z \ k\j\ m' > = mh S kk . S Jr S mm . _ 

< kj, m | J ± | k'J, m' ) = Ti v ; j(j + 1) - m'(m' ± S jr 6 mm . ±i (C-51) 


These relations show that the matrix elements representing the components of J 
depend only on j and m and not on k. 

In order to know, in all cases, the matrix associated with an arbitrary compo¬ 
nent J u in a standard basis, all we need to do, therefore, is calculate, once and for 
all, the “universal" matrices (J u ) U) which represent J u inside the subspaces S(k,j) 
for all possible, values of j ( j — 0, 1/2, 1 , 3/2, ...). When we study a particular 
physical system and its angular momentum J, we shall determine the values of j 
actually found in the problem, as well as the number of subspaces S(kJ) associated 
with each of them [that is, its degree of degeneracy (2 \j + 1 )g (/')]. We know that 
the matrix representing J u in this particular case has the “block-diagonal” 
form (C-49), and we can therefore construct it from the “universal" matrices 
which we have just defined: for each value of j, we shall have g(j) “blocks” iden¬ 
tical to (J u ) U) . 

Let us give some examples of (/J 0) matrices: 

(0 j = 0 

The subspaces (k,j = 0 ) are one-dimensional, since zero is the only possible 
value for m. The (/ u ) <0) matrices therefore reduce to numbers, which, according 
to (C-51), are zero. 

(«) j = 1/2 

The subspaces (k,j =1/2) are two-dimensional (m — 1/2 or — 1/2). If we 
choose the basis vectors in this order (m = 1 / 2 , m = — 1 / 2 ), we find, using (C-51): 


11 = - | 1 ® 
{ z) 2 l o - l 


and: 


^■"-*(2 i ) o ) 


that is, using (C-l): 


(Jy ) 1 


,( 1 / 2 ) 


sc ■;) 


The matrix representing J 2 is therefore: 




1 0 
1 


(C-52) 


(C-53) 


(C-54) 


(C-55) 


We thus find the matrices which were introduced without justification in chapter IV, 
§A-2. 
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(i")j= 1 


We now have (order of the basis vectors : m = 1, m = 0, m = — 1): 


(y z r - * 


/i ° o \ 
ooo 
Vo o - i / 



o 

o 


O 

o 

o 

+ 

II 

o 

o 

(j_) (,) = b 

o 

o 

i 

\0 0 o / 


lo \fl 0 / 


and therefore: 


(j x r 


b 

Si 


r° 

1 

0 \ 

Ti 

/0 - i 

0 \ 

i 

0 

1 

(j )0> = JL 

Wy) r 

' 0 

— i 

^0 

1 

0 J 

v 2 

Vo i 

0 / 


and: 

(J 2 ) (,) = 2b 2 fo 
VO 


0 0 \ 
1 0 
0 1 / 


(C-56) 


(C-57) 


(C-58) 


(C-59) 


COMMENT : 

It can be verified that matrices (C-56) and (C-58) satisfy the commuta¬ 
tion relations (B-9). 

(zV) j arbitrary 

We use relations (C-51), which, according to (C-l), can also be written : 

< k,j, m\J x \ k\j\ mi > = * 5 U . S jr 

X [v'/(i + 0 - m '( m ' + 1) An.m' + l + y/j(j + 0 - m '( m ' ~ 0 l] 

(C-60) 

and: 

< kj, m\J y \ k',j', m' > = — S kk . S jr 

x [\ ; j(j + 1) - m'{m' + 1 - SU + 1) - m'(m' - l)<5 m , 

(C-61) 

The matrix (J z ) ij) is therefore diagonal and its elements are the (2 j + 1) values 
of mb. The only non-zero matrix elements of (J X ) (J> and (T y ) (2) are those directly 
above and directly below the diagonal: (J x ) a> is symmetrical and real, and {J y ) U) is 
antisymmetrical and pure imaginary. 

Since the kets | k,j,m} are, by construction, eigenvectors of J 2 , we have: 

< k,j, m | J 2 | k'J', m' > = j(j + 1 )h 2 5 kk .b jy b mm . (C-62) 
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The matrix (J 2 ) < -’ ) is therefore proportional to the (2 j + 1) x (2/ x 1 ) unit matrix : 
its diagonal elements are all equal to j(j + 1 )ft 2 . 

comment : 

The Oz axis which we have chosen as the “quantization axis” is entirely 
arbitrary. All directions in space are physically equivalent, and we should 
expect the eigenvalues of J x or J y to be the same as those of J z (their eigen¬ 
vectors, however, are different, since J x and J y do not commute with J z ). 
It can indeed be verified that the eigenvalues of the (J x ) <1/2) and 

(J y ) (i/2) matrices [formulas (C-54)] are + and that those of the (J x ) llf 

and (J y ) (l> matrices [formulas (C-58)] are + Ji, 0, — Ti. This result 
is general: inside a given subspace S(k , j), the eigenvalues of J x and J y 
(like those of the component J u — J . u of J along an arbitrary unit vector u) 
are jfi, (j — 1 )ft, ..., (— j + l)h, — jh. The corresponding eigenvectors (eigen¬ 
vectors common to J 2 and J x , J 2 and J y , or J 2 and J u ) are linear combinations 
of the | k, j , m > with k and j fixed. 

To conclude this section devoted to “standard” representations, we sum¬ 
marize : 


An orthonormal basis { | k,j, m ) } of the state space, composed of eigen¬ 
vectors common to J 2 and J z : 

J 2 | k,j,m) = j{j + 1 )fi 2 | k, j, m > 

J z | k,j,m) = mh | k,j, m > 


is called a “standard basis” if the action of the operators J + and /_ on the 
basis vectors is given by : 


J + | kj, m > = h sjj{j + 1) - m(m + 1) | k,j, m + 1 > 
J- | kj, m) = h sjj(j + I) - m(m - 1 ) \k,j, m - 1 > 


D. APPLICATION TO ORBITAL ANGULAR MOMENTUM 


In §C, we studied the general properties of angular momenta, derived uniquely 
from the commutation relations (B-9). We shall now return to the orbital angular 
momentum L of a spinless particle [formula (B-3)] and see how the general 
theory just developed applies to this particular case. Using the { | r > } represen¬ 
tation, we shall show that the eigenvalues of the operator L 2 are the numbers 
/(/ + l )Ti 2 corresponding to all positive integral or zero / : of the possible values 
for j found in §C-2-b, the only ones allowed in this case are the integral values, 
all of which are present. Then we shall indicate the eigenfunctions common to L 2 
and L z and their principal properties. Finally, we shall study these eigenstates 
from a physical point of view. 
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1 . 


Eigenvalues and eigenfunctions of L 2 and L. 


EIGENVALUE EQUATION IN THE { | r > } REPRESENTATION 


In the { | r > } representation, the observables R and P correspond respec¬ 
tively to multiplication by r and to the differential operator --V. The three 


components of the angular momentum L can then be written: 


= 7 \y-r- - z ir. 


L v =^ z 


T II I V 

K =7 - y- 


(D-l-a) 


(D-l-b) 


(D-l-c) 


It is more convenient to work in spherical (or polar) coordinates, since, as 
we shall see, the various angular momentum operators act only on the angular 
variables 9 and (p , and not on the variable r. Instead of characterizing the vector r 
by its Cartesian components x, y, z, we label the corresponding point M in space 
(OM = r) by its spherical coordinates r, 9, <p (fig. 1): 

' x = r sin 9 cos (p 
■s y = r sin 9 sin (p 
_z = r cos 9 


with: 


(D-2) 


r 5* 0 
0 ^ 9 ^ n 
0 ^ (p < 2n 
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The volume element d 3 r = dx d y dz is written in spherical coordinates: 

d 3 r = r 2 sin 9 dr d 9 d(p 
= r 2 dr d £2 

where: 

d Q = sin 9 dd d tp 


(D-3) 

(D-4) 


is the solid angle element about the direction of polar angles 9 and tp. 

Applying the classical technique of changing variables, we obtain, from 
formulas (D-l) and (D-2), the following expressions (the calculations are rather 
time-consuming but p©se no great problems): 


. .+ ( . d cos tp 8 \ 

, .♦ ( d , sin tp 

L,= ^-cos„- + — 


d_ 

dtp 


L =*-*- 
2 i dtp 

which yield: 

V = - h 2 


d — +-L A +—L.. 

86 2 tan Odd sin 2 0 dtp 2 


) 


i *- #e “(^ +i co,e -k ) 

'{-Te +i cote 


L_ = tie' 


dtp 


(D-5-a) 


(D-5-b) 


(D-5-c) 


(D-6-a) 


(D-6-b) 

(D-6-c) 


In the { | r ) } representation, the eigenfunctions associated with the eigen¬ 
values /(/ + 1 )h 2 of L 2 and mti of L z are the solutions of the partial differential 
equations: 


1 s +. 1 


tan 9 86 sin 2 0 dtp 2 

g 

- i ^ 0, tp) = m i f/(r, 6, tp) 


j><A(r, 9, (p) = 


= /(/ + l)^(r,6), tp) (D-7-a) 
(D-7-b) 


Since the general results of §C are applicable to the orbital angular momentum, 
we already know that / is integral or half-integral and that, for fixed /, m can 
take on only the values — /, — /+ 1, / — 1, /. 

In equations (D-7), r does not appear in any differential operator, so we can 
consider it to be a parameter and take into account only the 6- and tp-dependence 
of if/. Thus, we denote by Y^(6, tp) a common eigenfunction of L 2 and L z which 
corresponds to the eigenvalues /(/ + 1 )h 2 and nth: 

L 2 Y?(0, tp) = 1(1+ l)h 2 Y?(9, tp) 

L z Y?(6, tp) = mh 77(6/, tp) 
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(D-8-a) 

(D-8-b) 
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To be completely rigorous, we would have to introduce an additional index 
in order to distinguish between the various solutions of (D-8) which correspond 
to the same pair of values of / and m. In fact, as we shall see further on, these 
equations have only one solution (to within a constant factor) for each pair of 
allowed values of / and m; this is why the indices / and m are sufficient. 

COMMENTS : 

(/) Equations (D-8) give the 0-and ^-dependence of the eigenfunctions of L 2 
and L z . Once the solution Y^(0, <p) of these equations has been found, 
these eigenfunctions will be obtained in the form: 


'I'lJr* d ’ <P) = f(r)Y?(8, <p) 


(D-9) 


where f(r) is a function of r* which appears as an integration constant 
for the partial differential equations (D-7). The fact that f(r) is arbitrary 
shows that L 2 and L z do not form a C.S.C.O. in the space S t of functions 
of r (or of r, 8, </>). 

In order to normalize m (r, 6, <p), it is convenient to normalize YT(8, <p) 
and f(r) separately (as we shall do here). We then have, taking (D-4) into 
account: 


/• Z 7l /% it 

d (p si 

* 0 * 0 


sin 0 I y ?(0, <p)| 2 dd = 1 


(D-10) 


[ r 2 |/(r)| 2 dr = 1 
Jo 


( 0 - 11 ) 


VALUES OF / AND m 

l and m must be integral 

Using expression (D-5-c) for L z , we can write (D-8-b) in the form: 
~ YT(0, (p) = nih Y?(8, (p) 


( 0 - 12 ) 


which shows that T“(0, <p) is equal to: 

*7(0, (p ) = ^(0) e im * > (0-13) 

We can cover all space by letting (p vary between 0 and 2n. Since a wave 
function must be continuous at all points in space**, we must have, in particular: 

Y?{0,(p= 0) = Y?\d, <p = 2tt) (D-14) 

which implies that: 

e 2imn = 1 (0-15) 

* f(r) must be such that ip, 0, <p ) is square-integrable. 

** If Y?(6, <p) were not continuous for <p = 0, it would not be differentiable and could not be 
an eigenfunction of the differential operators (D-5-c) and (D-6-a). For example, — YJ*(0, <p) would 
produce a function <$(<p), which is incompatible with (D-12). 
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According to the results of § C, m is integral or half-integral. Relation (D-l 5) 
shows that, in the case of an orbital angular momentum, m must be an integer 
(e 2im * would be equal to — 1 if m were half-integral). But we know that m and / 
are either both integral or both half-integral: it follows that l must also be an 
integer. 


p. All integral values (positive or zero) of l can be found 

Choose an integral value of / (positive or zero). We know from the general 
theory of §C that Y[(6, (p) must satisfy: 

L + Y\(0,<p) = 0 (D-16) 


which yields, taking (D-6-b) and (D-l3) into account: 


{io - lco '° 


m = o 


(D-l 7) 


This first order equation can be integrated immediately if we notice that: 


d(sin 6) 
cot Q dd = v . 

sin 0 

Its general solution is: 
Fj(0) = c,(sin 6) 1 


(D-l 8) 
(D-19) 


where c, is a normalization constant*. 

Consequently, for each positive or zero integral value of /, there exists a 
function Y\(0, <p) which is unique (to within a constant factor): 

y'(0, tp) = c,(sin ey e"* (D-20) 


Through the repeated action of L_, we construct T{ -1 , ..., T", ... T, - '. Thus 
we see that there corresponds, to the pair of eigenvalues /(/ + 1 )h 2 and mh (where 
/ is an arbitrary positive integer or zero and m is another integer such that 
— /</«</), one and only one eigenfunction : Y?(0, <p), which can be unambiguously 
calculated from (D-20). The eigenfunctions Y?(6, tp) are called spherical harmonics. 


c. FUNDAMENTAL PROPERTIES OF THE SPHERICAL HARMONICS 

The spherical harmonics T“(0, (p) will be studied in greater detail in 
complement A VI . Here we shall confine ourselves to summarizing this study by 
stating without proof its principal results. 

a. Recurrence relations 

According to the general results of §C, we have: 

L ± YT(0, (p) = h s/l(i + 7f- m(m71) Y" ± l (0, cp) (D-21) 

* Inversely, one can easily show that the function obtained in this way is actually an eigenfunction 
of L 2 and L 2 with eigenvalues /(/ -h 1 )h 2 and fh. According to (D-5-c) and (D-13), it is immediately seen 
that L z Y 1 ^9 , (p) = fh y|(0, </?). Then, using this equation and (D-16), as well as (C-7-b), one can show 
that T[(0, (p ) is also an eigenfunction of L 2 with the expected eigenvalue. 
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Using expressions (D-6-b) and (D-6-c) for the operators L + and L_ and the 
fact that T"(0, (p) is the product of a function of 0 alone and e""*, we obtain: 


cot dW(0, (p) = Vl(i + 1) - m(m + 1) Y” + '(0, (p) (D-22-a) 


- - m cot d^Y?{0, cp) = n//(/ + 1) - m(m - 1) Y?~ ‘(0, <p) 


(D-22-b) 


(3. Orthonormalization and closure relations 

Equations (D-7) determine the spherical harmonics only to within a constant 
factor. We now choose this factor so as to orthonormalize the T™(0, </>) (as 
functions of the angular variables 0 and (p): 

• 2 tt f it 

dtp sin 0 d 6 Y?’*{(h (p)Yr(O, cp) = 5 n 6 m . m (D-23) 

Jo Jo 

Furthermore, any function of 6 and <p, f(9, can be expanded in terms 
of the spherical harmonics: 

/(»,?!= f X c,, m Y?{0,<p) (D-24) 

1 = 0 m = -I 

with: 

/* 2n /% 7r 

C,, m = dj sinfl d 0 rrm cp)m <P) (D-25) 

Jo Jo 

The spherical harmonics therefore constitute an orthonormal basis in the space S Q 
of functions of 0 and <p. This fact is expressed by the closure relation: 

E I Y?{0, (p') = S(cos 0 - cos O') 5(<p - (p') 

t = 0 m~- —l 

= ^S(0-0')6(<p-<p') (D-26) 

[it is d(cos 6 — cos O'), and not 6(0 — O’) which enters into the right-hand side 

of the closure relation because the integrations over the variable 0 are performed 
using the differential element sin 6 dO = — d(cos 0)]. 


y. Parity and complex conjugation 

First of all, recall that the change from r to — r (reflection through the 
coordinate origin) is expressed in spherical coordinates by (fig. 2): 

r —> r 

0 ==> n - 0 (D-27) 

(p => n + (p 
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FIGURE 2 

Transformation in spherical coordinates of an 
arbitrary point by reflection through the origin; 
r is not changed, 0 becomes k — 0 9 and (p 
becomes n + ip. 


It is simple (see complement A VI ) to show that:, 

Yftn - d, n + (p) = (- l)'>7(tf. <p) (D-28) 

The spherical harmonics are therefore functions with a definite parity, which is 
independent of m ; they are even if / is even and odd if / is odd. 

Also, it can easily be seen that: 

[w?)]* = (- i rrrwv) (D-29) 


d. "STANDARD" BASES OF THE WAVE FUNCTION SPACE 

OF A SPIN LESS PARTICLE 

As we have already noted [comment (i) of §D-l-a], L 2 and L z do not 
constitute a C.S.C.O. in the wave function space of a spinless particle. We shall 
now indicate, relying on the reasoning and results of §C-3, the form of the 
“standard” bases of this space. 

Let <?(/, m = /) be the subspace of eigenfunctions common to L 2 and L z , 
of eigenvalues /(/ + 1 )h 2 and 11 i, where / is a fixed positive integer or zero. 
The first step in the construction of a “standard” basis (cf §C-3) consists 
of choosing an arbitrary orthonormal basis in each of the &(/, m = /). We 
shall denote by ^t.i.i(r) the functions which constitute the basis chosen in 
&{l, m = /), the index k (assumed to be discrete for simplicity) serving to distinguish 
between the various functions of this basis. By repeated application of the 
operator L_ on the ^ k:U (r), we then construct the functions J m (r) which complete 
the “standard” basis for m l; they satisfy equations (C-12) and (C-50), which 
become here: 

L 2 » = /(/+ 

'I'k.ijr) = mh <^, l>m (r) ( D - 3 °) 
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and: 

L ±^k.i.™( r ) = hy/1(1 + 1) - m(m ± 1) (/>*.,.„,+,( r ) (D-31) 

But we saw in §D-l-a that all eigenfunctions common to L 2 and L z which 
correspond to given eigenvalues /(/ + l)7i 2 and mh have the same angular 
dependence, that of K”(0, «j?); only their radial dependence differs. From equations 
(D-30), we therefore deduce that i/r t ( m (r) has the form: 

(D-32) 

Let us now show that, if the ij/ k , m (r) constitute a “standard” basis, the radial 
functions R k ,i,m( r ) are independent of m. Since the differential operators L ± do not 
act on the r-dependence, we have, according to (D-21): 

L±t k ,,Jr) = R k .,Jr)L ± YT(e , <*>) 

= h 1) - m(m±T) R u Jr) Y 7* «>) (D-33) 


Comparison with (D-31) shows that the radial functions must satisfy, for all r : 

Rk,l.m±\( r ) = Rk.l.m( r ) (D-34) 

and are consequently independent of m. The functions ij/ kt | >m (r) of a “standard” basis 
of the wave function space of a (spinless) particle are therefore necessarily of the 
form: 

YTie, <p) (D-35) 

The orthonormalization relation for such a basis is: 

j d 3 r K.iJ r ) = | r 2 df 'R*,i( r )R k .r( r ) 


2n 


dip 


sin 0 d 0Y?*(O, <p)Yf(0, <p) = S kk .d w 5 mm . (D-36) 


Jo Jo 

Since the spherical harmonics are orthonormal [formula (D-23)], we obtain, 
finally: 


/* X' 

r 2 dr R*,(r) R k Jr ) = 3 kk . (D-37) 

Jo 


The radial functions R kJ (r) are therefore normalized with respect to the variable r; 
moreover, two radial functions corresponding to the same value of / but to 
different indices k are orthogonal. 


COMMENTS : 

(i) Formula (D-37) is simply a consequence of the fact that the functions 
tf/ k ,i,i(r) = R k>l (r)Y\(6, (p) chosen as a basis in the subspace &(l, m = /) 
are orthonormal. It is therefore essential that the index / be the same for 
the two functions R k , appearing on the left-hand side. For / ^ i// t (-m (r) 
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and iA*\r,m'( r ) are orthogonal anyway because of their angular dependence 
(they are eigenfunctions of the Hermitian operator L 2 with different 
eigenvalues). The integral: 

/» x 

r 2 dr R* t (r) R k ,,.(r) (D-38) 

v 0 

may therefore take on any value a priori if / and /' are different. 

(ii) In general, the radial functions R kJ {r) depend on /, for the following reason. 
A function of the form f(r)g(6, q>) can be continuous at the coordinate origin 
(r = 0 , 6 and q> arbitrary) only if g(6, q>) reduces to a constant or if f(r) 
goes to zero at r = 0 [since if g(6, (p) depends on 6 and </>, the limit of 
f(r)g(8, cp) when r —> 0 depends on the direction along which one approaches 
the origin if /(0) is not zero]. Consequently, if we want the basis 
functions ij/ kil<m {r) to be continuous, only the radial functions corresponding 
to / = 0 can be non-zero at r = 0 [To(0, <p) is indeed a constant]. Similarly, 
if we require the ^ t ,/, m (r) to be differentiable (once or several times) at 
the origin, we obtain conditions for the R kJ (r) which depend on the value 
of /. 


2. Physical considerations 

a. STUDY OF A I k. /. m > STATE 

Consider a (spinless) particle in an eigenstate | k, /, m > of L 2 and L z 
[whose associated wave function is ^ k ,i,m{ r )l' that is, a state in which the square 
of its angular momentum and the projection of this angular momentum along the Oz 
axis have well-defined values [/(/ + l)fi 2 and mb respectively]. 

Suppose that we want to measure the component along the Ox or Oy axis 
of the angular momentum of this particle. Since L x and L y do not commute 
with L z , \k, l, m} is an eigenstate neither of L x nor of L y ; we cannot, 
therefore, predict with certainty the result of such a measurement. Let us 
calculate the mean values and root-mean-square deviations of L x and L y in the 
state \ k, l, m ). 

These calculations can be performed very simply if we express L x and L y 
in terms of L + and L_ by inverting formulas (C-l): 

+L_) 

L, =i(L, -L.) (D-39) 

Thus we see that L x ( k, /, m > and L y \ k, /, m > are linear combinations of | k, l, m + 1 > 
and | k, /, m — 1 >; this leads to : 

< k, /, m | L x | k, /, m > = < k, l, m \ L y \ k, /, m > = 0 (D-40) 
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Furthermore: 


< k, /, m | L 2 | k, l, m > = — < k, /, m | (L 2 + L 2 + L+L_ + L_L + ) | /c, l, m ) 


< k, l, m | L 2 | k, l, m > = (D-4l) 

(L 2 + + L 2 _ - L + L_ - L_L + ) \ k, l, m > 

The terms in L\ and L 2 _ do not contribute to the result, since L 2 ± \ k, l, m } 
is proportional to | k, /, m ± 2 >. In addition, formula (C-8) yields: 

L + L_ + L_L+ = 2(L 2 - L z 2 ) (D-42) 


We therefore obtain : 

< k, l, m | L 2 | /c, /, m > = < k, /, m | L 2 | /c, /, m > 


= ^ < k, /, m | (L 2 - L 2 ) | fc, /, m > 

= ^ [Hi + 1) - m 2 ] 

Thus, in the state | k, /, m }: 

< L x > = < > = 0 

/] 

= = /i[l(/ +l)-m 2 ] 

V z 


(D-43) 


(D-44-a) 

(D-44-b) 


These results suggest the following picture. Consider a classical angular 

momentum, whose modulus is equal to ft v -/ 1(1 + 1) and whose projection along 
Oz is mft (fig. 3): 

|OL| = ftjjl + 1) 

OH = mft (D-45) 



FIGURE 3 

A classical model which can be applied to the 
orbital angular momentum of a particle in a 
state | /, m >. We assume that the distance |OL| 
and the angle & are known, but that <P is a 
random variable whose probability density is 
constant inside the interval [0, 2The clas¬ 
sical mean values of the components of OL, as 
well as those of the squares of these compo¬ 
nents, are then equal to the corresponding 
quantum mechanical mean values. 
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We denote by 0 and <P the polar angles which characterize its direction. 
Since the triangle OLJ has a right angle at J , and OH = JL, we have: 

OJ = sfoiT^OH 1 = h\/l(l + 1) - m 1 (D-46) 

Consequently, the components of such a classical angular momentum would be: 

01 = h \/'l{l + \) - m 2 cos <P 

OK = h\/l(l + l) — m 2 sin * (D-47) 

OH = h V/(l + 1) cos 0 = mh 

Now let us assume that |OL| and 0 are known and that <P is a random 
variable which can take on any value in the interval [0, 2it], all these values 
being equally probable (an evenly distributed random variable). We then have, 
averaging over <f>: 

/. 2n 

< 01 > oc cos <P d<P = 0 (D-48-a) 

Jo 

/* 2n 

< OK > oc sin <f> d<P = 0 (D-48-b) 

Jo 


which corresponds to (D-44-a). In addition: 


. I 


< OI 2 > = -^- H 2 [l(l + 1) - m 2 ] cos 2 <P d<P — 
2n 


/• 2n 

Jo ‘ 


K tv +1) -» ! ] 


(D-49) 


and, similarly: 

< OK 2 > = y [1(1 + 1) - m 2 ] (D-50) 

These mean values are identical to the ones we found in (D-44). Consequently, 
the angular momentum of a particle in the state | k, /, m ) behaves, insofar as the 
mean values of its components and their squares are concerned, like a classical 

angular momentum of magnitude Tiy/l(l + 1) having a projection mh along 
Oz, but for which <P is a random variable evenly distributed between 0 and 2n. 

Of course, this picture must be used carefully : we have shown throughout 
this chapter how much the quantum mechanical properties of angular momenta 
differ from their classical properties. In particular, we must stress the fact that 
an individual measurement of L x or L y on a particle in the state | k, /, m ) cannot 

yield an arbitrary value between — h \f /(/ + 1) — m 2 and + h %//(/” + 1) - m 2 , 
as the preceding model might lead us to believe. The only possible results are the 
eigenvalues of L x or L y (we saw at the end of §C that these are the same as those 
of L z ), that is, since / is fixed here, one of the (21 + 1) values Ih, (l - 1 )h, ..., 
(-/+!)», -/ft. 
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b. CALCULATION OF THE PHYSICAL PREDICTIONS 
CONCERNING MEASUREMENTS OF L 2 AND L, 

Consider a particle whose state is described by the (normalized) wave 
function: 

< r | ^ > = »A(r) = ^(r, 0, </>) (D-51) 

We know that a measurement of L 2 can yield only the results 0, 2ft 2 , 6ft 2 , 

/(/ 4- 1 )ft 2 , and a measurement of L z , only the results 0, + ft, + 2 h, ..., 
mh, ... How can we calculate the probabilities of these different results from the 
wave function \j/(r, 9, <p)? 


a. General formulas 

Let us denote by ^L 2 , L fl, m) the probability of finding, in a simultaneous 
measurement of L 2 and L z , the results /(/ + 1 )h 2 and mh. This probability can 
be obtained by expanding ij/(r) on a basis composed of eigenfunctions of L 2 and 
L,\ we shall choose a “standard” basis of the type introduced in §D-l-d : 

^k.l ,m( r ) = Rk.l( r )^ T(^ ( p) (D-52) 

*// (r) can then be written: 

<A(r) = III c k,i.m Rjr)Y?(^ </>) (D-53) 

k I m 


where the coefficients c k , m can be calculated by using the usual formula : 


C k.l.m 


d ir •/'k.i.m(r) ij/(r) 


/% / /• 2 n /• re 

r 2 dr R*,(r) I d<p \ sin 0 dO Y?*(9, cp) tA(r, 0, (p) 

*, 0 */ 0 *J 0 


(D-54) 


According to the postulates of chapter III, the probability Lt (l, m) is 
given, under these conditions, by: 

>n) = I k, J 2 (D-55) 

k 

If we measure only L 2 , the probability of finding the result /(/ + 1 )h 2 is equal to : 

^(0 = I •Vl.(U) s I I KJ 2 (D-56) 


Similarly, if it is only L z that we wish to measure, the probability of obtaining mh 

is: 

= Z m ) = Z Z l c ft./.mi 2 (°- 57 ) 

l ^\m\ k l ^\m\ 


(the restriction / ^ \m\ is automatically satisfied, since there are no coefficients 
c k.i.m f° r which \m\ would be greater than /). 
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Actually, since L 2 and L z act only on 9 and (p , we see that it is the 6- and 
(^-dependence of the wave function ^(r) that count in the preceding probability 
calculations. To be more precise, consider 6, </>) as a function of 6 and (p 
depending on the parameter r. Like any other function of 9 and q>, can then 
be expanded in terms of the spherical harmonics: 

^ 0, </>) = !!«,>) TO «>) (D-58) 

l m 

The coefficients a t m of this expansion depend on the “parameter” r and are given 
by: 

/• 2n /» n 

aijr) = d<p sin 9 d 9 TP(0, </>) *(r, 6, <p) (D-59) 

Jo Jo 

If we compare expressions (D-58) and (D-53), we see that the c k l m are the 
coefficients of the expansion of a l m {r) on the functions R kit (r): 

a i, m ( r ) = S c k ,i. m KM (D-60) 

k 


with, taking (D-54) and (D-59) into account: 

/» 00 

C k,l,m = r 2 dr R tl( r ) a U r ) 

Jo 

Using (D-37) and (D-60), we also obtain: 

r 2 & |a I>m (r )| 2 = X K,, J 2 


I 


(D-61) 


(D-62) 


The probability P L2 Li (/, m) [formula (D-55)] can therefore also be written in 
the form: 


m )=[ 

Jo 


r 1 dr \a t Jr)\ 2 


(D-63) 


From this, we can deduce, as in (D-56) and (D-57): 

^l 2 (0 = Z f r2 dr \% m {r)\ 2 

m = -i Jo 

and: 

^l z W = I f r 2 dr \a t Jr)\ 2 
i>H Jo 


(D-64) 


(D-65) 


[here again, a l m (r) exists only for / ^ |w|]. Consequently, to obtain the physical 
predictions concerning measurements of L 2 and L z , we may consider the wave 
function as depending only on 9 and (p. We then expand it in terms of the 
spherical harmonics as in (D-58) and apply formulas (D-63), (D-64) and (D-65). 

Similarly, since L z acts only on q>, it is the (^-dependence of the wave 
function i/r(r) which counts in the calculation of To see this, we shall 
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use the fact that the spherical harmonics are products of a function of 6 alone 
and a function of (p alone. We shall write them in the form: 


YT(0, <p) = zr(0) — 


(D-66) 


so that each of the functions of the product is normalized, since we have: 


dip — 


= <5_- 


v 2n v 2n 


(D-67) 


Substituting this formula into the orthonormalization relation (D-23) for the 
spherical harmonics, we find: 


sin 0 d 0 Z?*(0) Z?{0) - d iv 


(D-68) 


[for reasons analogous to those indicated in comment (/) of §D-l-d, the same 
value of m is involved in both functions Z" of the left-hand side]. 

If we consider i [/(r, 0, (p) to be a function of <p defined in the interval [0, 27i] 
and depending on the “parameters” r and 6, we can expand it in a Fourier 


<P) = 


(D-69) 


where the coefficients b n (r, 6) can be calculated from the formula: 


b m (r, 0) = 


V 2n Jo 


dip e tmv i j/(r, 0, <p) 


(D-70) 


If we compare formulas (D-69) and (D-70) with (D-58) and (D-59), we see that 
the a t m {r) for fixed m are the coefficients of the expansion of b m (r, 6 ) on the 
functions Z” corresponding to the same value of m : 


b m {r,6) = Y,a l Jr)Z?(0) 


(D-71) 


with: 


-r- 


a, Jr) = sin 0 dd Z™*(0) b m (r , 0) 


With (D-68) taken into account, expansion (D-71) requires that: 


sin 0 dO | b m (r, 0)\ 2 = X | a t Jr) 


Substituting this formula into (D-65), we obtain & Lt (m) in the form : 


/» 00 /» 7T 

= r 2 dr sin 0 d0 \bjr. Of 
Jo Jo 


(D-72) 


(D-73) 


(D-74) 
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Therefore, as far as measurements of L z alone are concerned, all we need to do 
is consider the wave function as depending solely on q> and expand it in a Fourier 
series as in (D-69) in order to calculate the probabilities of the various possible 
results. 

We might be tempted to think that an argument analogous to the preceding 
ones would give ^ t2 (/) in terms of the expansion of t j/(r, 6, <p) with respect to the 
variable 6 alone. In fact, this is not the case: predictions concerning a measurement 
of L 2 alone involve both the 6- and the (^-dependence of the wave function; this 
is related to the fact that L 2 acts on both d and <p. We must therefore use 
formula (D-64). 


|3. Special cases and examples 

Suppose that the wave function t^(r) which represents the state of the 
particle appears in the form of a product of a function of r alone and a function 
of 0 and </>: 

<AM,(/>) = f(r)g(6,(p) (D-75) 

We can always assume f(r) and g(0, cp) to be separately normalized: 



d r\f{r)\ 2 = 1 


/• 2 it /» n 

d <p sin 0 d 0 \g(6 , <p)| 2 = 

*. 0 J 0 


(D-76-a) 


(D-76-b) 


To obtain the expansion (D-58) of such a wave function, all we must do is 
expand g(6, <p) in terms of the spherical harmonics: 


cp) = II d Lm YT((K <p) 

l m 

with: 

/» 27t /• it 

di,m = I d<p sin 0 d 0 Y?*(0, q>) g(0, q>) 

Jo Jo 


(D-77) 


(D-78) 


In this case, therefore, the coefficients a l m (r) of formula (D-58) are all proportional 
to f(r): 

a i,m(r) = d l m f(r) (D-79) 

With (D-76-a) taken into account, expression (D-63) for the probability Li (l, m) 
here becomes simply : 

>n) = \d,. m \ 2 (D-80) 

This probability is totally independent of the radial part f (r) of the wave function. 

Similarly, let us consider the case in which the wave function i l/(r,6,<p) 
is the product of three functions of a single variable: 

0, (p) = f(r)h(0)k((p) (D-81) 
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which we shall assume to be separately normalized: 


/* X /* n /* 2 it 

r 2 dr|/(r)| 2 = sin 0 dO \h(0)\ 2 = dq>\k((p)\ 2 = 

L 0 4/0 4 0 


(D-82) 


Of course, (D-81) is a special case of (D-75), and the results we have just 
established apply here. But, in addition, if we are interested only in a measurement 
of L z , all we must do is expand k((p) in the form: 

%) = (D-83) 

m V 271 

where: 


e m = —j= 1 d<p e~ im<p k(q>) 

v 2 n Jo 

in order to obtain the equivalent of formula (D-69), with: 
b m (r, 0) = e m f(r) h(0) 

According to (D-82), ^ u (m ) is then given by (D-74) as: 
^lM) = l e m| 2 


(D-84) 


(D-85) 

(D-86) 


The preceding considerations can be illustrated by some very simple examples. 
First, let us assume that the wave function ij/(r) is in fact independent of d and (p, 
so that: 


m = ~ 

v l 

k(<p) = -U 

\ 2n 


(D-87) 


We then have: 

c,(0, cp) = j= — Y°(0,cp) 

\Un 

Thus a measurement of L 2 or- of L z must yield zero. 

Now, let us modify only the 0-dependence, choosing: 

h(0) =■ /| cos 0 

V 2 

k(cp) = -L 
l sJ2k 

In this case: 

m<p)= l-cosO-Y^kv) 


(D-88) 


(D-89) 


(D-90) 
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We are again sure of the results of measuring L 2 or L z . For L 2 , we can only obtain 2fi 2 ; 
for L z , 0. It can be verified that this modification of the 0-dependence has not 
changed the physical predictions concerning the measurement of L z . 

On the other hand, if we modify the (^-dependence by setting, for example : 


uiv) = ~F= 

V In 


(D-91) 


g(9, <p) is no longer equal to a single spherical harmonic. According to (D-86), 
all the probabilities 0* Li (m) are zero except for: 

^l> = 1) = 1 (D-92) 


But the predictions concerning a measurement of L 2 are also changed with 
respect to the case (D-87). In order to calculate these predictions, we must 
expand the function: 


<P) 



(D-93) 


on the spherical harmonics. It can be verified that all the Y™(0, (p), with odd / 
and m = 1, actually appear in the expansion of the function (D-93). Consequently, 
we are no longer sure of the result of a measurement of L 2 (the probabilities of 
the various possible results can be calculated from the expression for the spherical 
harmonics). We therefore conclude from this example that, as pointed out at 
the end of §a, the (^-dependence of the wave function also enters into the 
calculation of predictions concerning measurements of L 2 . 


References and suggestions for further reading: 

Dirac (1.13), §§35 and 36; Messiah (1.17), chap. XIII; Rose (2.19); Edmonds (2.21). 
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COMPLEMENTS OF CHAPTER VI 


A v .: SPHERICAL HARMONICS 

A VI : detailed study of the spherical harmonics 
>7(0, (p ); establishes certain properties used in 
chapter VI, as well as in certain subsequent 
complements. 

B VI : ANGULAR MOMENTUM 

AND ROTATIONS 

B VJ ‘. brings out the close relation which exists 
between the angular momentum J of a quantum 
mechanical system and the spatial rotations that 
can be performed on it. Shows that the commu¬ 
tation relations between the components of J 
express purely geometrical properties of these 
rotations; introduces the concept of a scalar or 
vector observable, which will reappear in other 
complements (especially D x ). Important theore¬ 
tically; however, sometimes difficult; can be 
reserved for later study. 

C v ,: ROTATION OF DIATOMIC MOLECULES 

C V p a simple and direct application of quant¬ 
um mechanical properties of angular momen¬ 
tum : pure rotational spectra of heteropolar 
diatomic molecules and Raman rotational spec¬ 
tra. Elementary level. Because of the importance 
of the phenomena studied in physics and che¬ 
mistry, can be recommended for a first reading. 

D v ,: ANGULAR MOMENTUM 

OF STATIONARY STATES 

OF A TWO-DIMENSIONAL 

HARMONIC OSCILLATOR 

D V |: can be considered as a worked example. 
Studies the stationary states of the two-dimen¬ 
sional harmonic oscillator; in order to class 
these states by angular momentum, introduces 
the concept of “ circular quanta ”. Not theore¬ 
tically difficult. Some results will be used in 
complement E vr 

E VI : A CHARGED PARTICLE 

IN A MAGNETIC FIELD ; 

LANDAU LEVELS 

E V1 : general study of the quantum mechanical 
properties of a charged particle in a magnetic 
field, followed by a study of the important special 
case in which the magnetic field is uniform 
(Landau levels). Not theoretically difficult. 
Recommended for a first reading, which can, 
however, be confined to §§ J -a and b, 2-a and b, 
3-a. 

F V1 : EXERCISES 
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Complement A V i 
SPHERICAL HARMONICS 


1. Calculation of spherical harmonics 

a. Determination of Y\(0 , <p) 

b. General expression for (p) 

c. Explicit expressions for l — 0, I and 2 

2. Properties of spherical harmonics 

a. Recurrence relations 

b. Orthonormalization and closure relations 

c. Parity 

d. Complex conjugation 

e. Relation between the spherical harmonics and the Legendre polynomials and associated Legendre 
functions 


This complement is devoted to the study of the form and principal properties 
of spherical harmonics. It includes the proofs of certain results which were 
stated without proof in § D-l-c of chapter VI. 


1. Calculation of spherical harmonics 


In order to calculate the various spherical harmonics Y”(0, cp). we shall 
use the method indicated in chapter VI (§D-l-c): starting with the expression 
for Y\(0, (p), we shall use the operator L_ to obtain by iteration the spherical 
harmonics corresponding to the same value of / and the (21 + 1) values of m 
associated with it. Recall that the operators L + and L _ act only on the angular 
dependence of a wave function and can be written: 


L + = ft e 


: i(p _|_ 


d 

d() 


+ i 


cot 0 — 

r<p 


( 1 ) 


a. DETERMINATION OF Y',(0. <p) 

We have seen (§D-l-b of chapter VI) that Y\(0. (p) can be calculated from 
the equation: 


L + Y\(0, q>) = 0 

(2) 

and from the fact that: 


y'(0, (p) = F\(0) e il * 

(3) 

Thus we obtained: 


Y‘,(0, <p) - c,(sin 0)‘ e"* 

(4) 

where c, is an arbitrary constant. 
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First, let us determine the absolute value of c, by requiring Y[(d, ip) to be 
normalized with respect to the angular variables 6 and ip : 

/» 2n /• rt /* 2it /» n 

d ip sin 6 dd | Y \(0, (p)\ 2 = |c,| 2 dip sin 6 dd (sin 9) 21 = 1 (5) 

Jo Jo Jo Jo 


We obtain: 

hi 2 = 1/2 m, 

where I, is given by: 

/* n p + 1 

/, = j sin 6 d6 (sin 6) 2 ‘ =1 d«(l — u 2 )' 


( 6 ) 

(7) 


(setting u — cos 6). /, can easily be calculated by recurrence since: 

/» + 1 /* + 1 
I t = J du (1 — w 2 Xl — m 2 ) 1-1 = /,_! — j duu 2 (\ — M 2 )'" 1 (8) 


An integration by parts of the last integral yields: 


h — h-i 2 /^' 

(9) 

We therefore have: 


I - 21 I 

h 21 + 1 i, ‘ l 

(10) 

with: 

/ 0 = | du = 2 

(11) 

From this, we can immediately derive the value of /,: 


(2/)!! 2 2,+ 1 (/!) 2 

Il ~ (21 + 1)!! - (21 + 1)! 

(12) 

Y\(9, (p) is then normalized if: 


1 1(21+1)1 

C| _ 2*/! V 4n 

(13) 

In order to define c, completely, we must choose its phase, 
to choose: 

It is customary 

... (-1 y k2i +1)! 

' 2' /! V 4n 

(14) 

We shall see later that, with this convention, Y°(d, q>) (which is independent of ip) 
has a real positive value for 0 = 0. 
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b. GENERAL EXPRESSION FOR »T(6>. tp) 

We shall obtain the other spherical harmonics Y”(6, (p) by successive 
application of the operator L_ to the F{(0, <p) which we have just determined. 
First, we shall prove a convenient formula which will enable us to simplify the 
calculations. 

a. The action of (L ± Y on a function of the form e ,nv> F(9) 

The action of the operators L + and L_ on a function of the form e in<p F(9) 
(where n is any integer) is given by: 

L ± [e'TF(0)] = + h e i <" ±1) * (sin [(sin 0)*"F(0)] (15) 

More generally: 

(L ± ) p [e , ' , ‘ p F(0)] = (+h) p t i(n±p)q> (sin 0) p±n -—— [(sin 0)+"F(0)] (16) 

d(cos 9) p 

First, let us prove formula (15). We know that: 

= (, 7 ) 
d(cos 0) d(cos 0) d 0 sin 0 d0 

and therefore: 

= (sin ey ±B n (sin 0)*"- 1 cos 0 F(0) + (sin 0)* n 


= - + » 


cot 6 F{0) + - 


Consequently: 


„< < n±l)^(sin oy±n 


d(cos 0) 


[(sin 0)*"F(0)] 


= — n cot 0 ± —J e ,<n± 

= e ±i< 4 ±4- + i cot 0 -JU e in *F(0) (19) 

|_ 30 d<p_] 


We recognize expression (1) for the operators L + and L_ ; relation (19) is therefore identical 
to (15). 

Now, to establish formula (16), we can reason by recurrence, since, for p — 1, (16) reduces 
to (15), which we have just proved. Let us therefore assume that relation (16) is true for (p — 1) : 

d p_ 1 

(L + ) p ~ 1 [e iH<p F(0)] - ( + ft)* -1 e ,(n±p * 1)<p (sin 0) p ~ l±n x -[(sin 0)**F(6 )] (20) 

d(cos 6 ) p ~ 1 

and let us show that it is then also valid for p. To do so, we apply L ± to both sides of (20); 
for the right-hand side, we can use formula (15), making the substitutions: 

n —> n + p ^ 1 


F(0) (sin 0) p 


d(cos 0) p 1 


[(sin 0)* n F(0j] 
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We then obtain: 

(L ± y[e in *F(0)] = ( + h) p e ,(n± ^(sin 0) ±n +*> x 

X — -<j (sin 0)*"~ p+1 (sin — — — [(sin 0)*"F(0)] l 

d(cos 9)X d(cos i J 

d p 

= ( + hy e i( " ±p, « , (sin 0)' ,±n -[(sin O) :fn F(0)] 

d(cos ey 


Formula (16) is therefore proven by recurrence. 


( 22 ) 


p. Calculation of Yf(9, (p)from Y\(9, <p) 

As we have already indicated (chapter VI, §D-l-c-a), the spherical harmon¬ 
ics Y*(0, (p) must satisfy : 

L ± Y?(8 , <p) = + 1) - m(m ± \)Y? ±l (9, (p) 

= h v(/ + m)(f ± m + 1) y '(0, <p) (23) 

These relations automatically insure that Y™* 1 is normalized if Y” is. Also, they 
fix the relative phases of spherical harmonics corresponding to the same value 
of / and different values of m. 

In particular, we can calculate Y?(8, <p) from Y\(9, (p) by using the operator L_ 
given by (1) and formula (23). Thus, we shall obtain directly a normalized 
function Y?(9, q>) whose phase will be determined by the convention used 
for Y\(8, (p) [formula (14)]. To go from Y{(0, </>) to YT(9, (p), we must apply 
(/ — m) times the operator L_ ; according to (23), we thus obtain: 

(L_r m Y K(M_ 

= (h)'~ m \/ (2/)(l) x (21 - 1)(2) x ... x (/ + m + 1)(/ - m)Y?(8 , (p) (24) 


that is: 

Y?(9, <p) 


/ (/ + m)! 

yj(21)1(1 - m)\ 



(25) 


Finally, using expression (4) for Y{(0, cp) [where the coefficient c, is given 
by (14)] and formula (16) (with n = l and p — l — m ), we can write (25) explicitly 
in the form: 


Y7(0, <p) 


lull! 

2 ' /! 


/2/ + 1 (/ + m)! imf 

y 4 n (l - m)\ 


(sin 8) 


d' -m 

d(cos 8f~ M 


(sin 8) 21 


(26) 


y. Calculation of Y”(0, (p)from Y t '(9, <p) 

In order to obtain expression (26), we started with the result of § 1-a. It is, 
of course, just as easy to calculate Y,~'(0, (p) first and then use the operator L + . 
The expression thus obtained for Y“ is different from (26), although the two are 
completely equivalent. 
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Let us therefore calculate Yf l (9, <p) from (26)*. Since: 

(sin d) 21 = (1 — cos 2 6) 1 (27) 

is a polynomial of degree 21 in cos 9, only its highest-order term contributes 
to Y;'(9, (p) : 

—(sin 9) 2l = (~ 1)'(20* (28) 

d(cos 9) 21 

We therefore immediately find that: 

Y r‘ (ft k) = 77 V < 29 > 

Y”(0, (p) can then be obtained by applying (/ + m) times the operator L + 
to Y ( _ '(0, (p ). Using (23) and (16), we finally arrive at: 

(_ iy +m fii + I (I — m )I . d , + ni 

y -(0, <p) = L—J— p-- i— (- e""” (sin 0f —--— (sin d) 21 (30) 

2'/! V 4n ( l +m Y d(cos0)' +m 


c. EXPLICIT EXPRESSIONS FOR / = 0, 1 AND 2 

General formulas (26) and (30) yield the spherical harmonics for the first 
values of /: 

n=-r= (3i) 


7^(0,^,)=+ IJL sin 0 e ±iv 


Y° l ( 9 ,<p) = liz cos 9 


Y± 2 (0, (p) = 1-^- sin 2 9 e ±2i<p 
v 32 ji 

/l5 

y± ‘(0, q,) = + sin 9 cos 9 e ±Ut ’ 
Y° 2 ( 9 , (p) = JJL.(Scos 2 °- 1 ) 


* We could obviously calculate Y t 1 from the equation : 

L.Y; l (e 9 <p) = 0 

However, its phase would then remain arbitrary. By using (26), we shall determine *(0, <p) completely, 
and its phase will be a consequence of the convention chosen in § 1-a. 
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2. Properties of spherical harmonics 


a. RECURRENCE RELATIONS 


By their very construction, the spherical harmonics satisfy relations (23); 
that is, using (1): 

e 1>* ■- m H 1 ™' 1= <“> 

Also note the following formula, which is often useful: 


_ fn . i(l 4- m + 1)(/ — tn 4- 1 ) > * 

os e i7(9, & - ^ (rngTir ,(ft ** 

. /(i + '"X 1 - ™) 


pr+ijpr^ij y "'* 9, ^ < 35 > 


Here is an outline of its proof. According to (25): 

“ S 9 r * - V'(20!(l-L)! “ S * (¥) ” ) 

Now, using expression (1) for L_, it is easy to verify that: 
[L_,cos 0] = h e _I> sin 0 


[L_, e ** sin 0] = 0 


we obtain: 


4(TriTH + [(T 


= e~ iv sin 9 + (fc - 1) e" i<p sin 0 (4^) 


Using a recurrence argument, we can then calculate the commutator of I —} and cos 0, 
since, if we assume that: \"/ 


that is: 




k e i<f> sin 0 




e sin 6 
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This relation has therefore been established by recurrence. We can use it to write (36) in the 
form: 


cos 0 Yf - 


/ (J_+mV 

(2 1)1(1-m)* 


cos 6 Y‘, — (/ - m)\ 


e - "’’ sin 0 Y‘, (42) 


Using (4) and (14), we can easily show that: 

e -l> sin 0 Y\ = - J ~~jj~ - (1 - cos2 Y'z] (43) 

If we then calculate the explicit expressions for Y l t+1 and Y\+ J from the general expression (26), 
we find that: 

cos 0 Y\ = — , Tj + , (44-a) 

\f 21 + 3 


c °s 2 ^- -irrJimK 1 ' + 2rri Y ' = ‘> 


Substituting relations (43) and (44) into (42) and using (23), we obtain (35). 


b. ORTHONORMALIZATION AND CLOSURE RELATIONS 

Because of the way we constructed them, the spherical harmonics constitute 
a set of normalized functions; they are also orthogonal, since they are eigen¬ 
functions of the Hermitian operators L 2 and L z with different eigenvalues. The 
corresponding orthonormalization relation is : 

/* 2 it /» n 

d <p sin B d 0 Yr*(0, if>) Y? ((h cp) = S lr d mm . (45) 


0 Jo 


It can be shown (here, we shall assume) that any square-integrable function 
of 6 and (p can be expanded in one and only one way on the spherical harmonics: 

mv) -m^ m YT(B,<p) ( 46 ) 


with: 


2 n /* 7i 


0 JO 


d ip sin 0 d 0 Y?*(0 , cp) f((K (p) 


The set of spherical harmonics therefore constitutes an orthonormal basis of the 
space of square-integrable functions of 6 and (p. This can be expressed by the 
closure relation: 


XI Y?(0, ip)Y?*{O', cp') = <5(cos 0 - cos O') 6(cp - cp') 

l m 
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C. PARITY 

The parity operation on a function defined in ordinary space (cf. comple¬ 
ment F„) consists of replacing in this function the coordinates of any point in space 
by those of the point symmetrical to it with respect to the origin of the reference 
frame: 

r => — r (49) 

In spherical coordinates, this operation is expressed by the substitutions (fig. 2 

of chapter VI) : 

0 =_> n - 0 (50) 

q> ==> n + (p 

Consequently, if we are using a standard basis for the wave function space of 
a spinless particle (§D-l-d of chapter VI), the radial part of the basis func¬ 
tions ^ k / m (r) is unchanged by the parity operation. The only transformation 
is that of the spherical harmonics, which we shall now describe. 

First, note that in the substitution of (50): 

sin 9 ==> sin 9 

cos 9 ==> - cos 9 (51) 

Qtm(p ^| yn ^inup 

Under these conditions, the function T|(0, ip) which we calculated in § 1-a is 
transformed into: 


9,n + <p) = (- 1)' Y\ (0,<p) 


Moreover: 


Relations (51) and (53) show that the operators L + and L_ [formulas (1)] remain 
unchanged [which means that L + and L_ are even operators, in the sense defined 
in complement F„ (§ 2-a)]. Consequently, according to result (52) and formula (25), 
which enables us to calculate Y"(9, cp): 

YT(n - 9,n + cp)= (- \) l r?(0,<p) (54) 

The spherical harmonics are therefore functions whose parity is well-defined 
and independent of m : they are even for / even and odd for / odd. 


COMPLEX CONJUGATION 


Because of their ^-dependence, the spherical harmonics are complex-valued 
functions. It can be seen directly, by comparing (26) and (30), that: 

[w *)]* = (- iriv"( 0 ,*) (55) 
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e. RELATION BETWEEN THE SPHERICAL HARMONICS 

AND THE LEGENDRE POLYNOMIALS AND ASSOCIATED LEGENDRE FUNCTIONS 

The 0-dependence of the spherical harmonics resides in functions known 
as Legendre polynomials and associated Legendre functions. We shall neither 
prove nor even enumerate all the properties of these functions, but shall simply 
indicate their relation with the spherical harmonics. 


a. T°(0) is proportional to a Legendre polynomial 
For m = 0, formulas (26) and (30) yield: 


yftQ = ^ 


(- 1)' 21+1 _d* 


- (sin 0) 2 


2'/! V 4n d(cos0)' 
which can be written in the form: 

Y ?W-J--fp-rncose) (57) 

setting: 

P,(u) = _ u 2y ( 5 8) 

,w 2' l ! du ' v ’ 

According to its definition (58), P,(u) is an /th order polynomial in u 
of parity (— l) 1 *: 

P l (~u)= (- l) l P,(u) (59) 

P,(u ) is the Ith order Legendre polynomial. It is easy to show that it has l zeros 
in the interval [— 1, + 1], and that the numerical coefficient in (58) insures that: 

/>,(!) =1 (60) 

It can also be proven that the Legendre polynomials form a set of orthogonal 
functions: 

r +1 * 2 

du P t {u) P,,(u) = sin 0 d0 P,(cos 0) P,,(cos 0) = - — S, r (61) 

— 1 * 0 

on which can be expanded functions of 0 alone : 

f(0) * I c, P,{cos 0) (62) 


* Parity with respect to the variable Note, however, that the parity operation in space [for¬ 
mulas (50)] amounts to changing cos 9 to — cos 6 ; property (59) can be expressed by: 

y?(* - 0 ) = (- i)'y?(f>) 

which is a special case of (54). 
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with: 

c, = —Y 1 f sin 9 do P,(cos 9) f(9) (63) 

Jo 

COMMENT : 

According to (57) and (60): 

yfi °> - (mi 

As we pointed out in §l-a, the phase convention chosen for Y\(6, (p) gives 
a real positive value to F°(0). 

|i. Y'"{9, (p) is proportional to an associated Legendre function 

For m positive, YT(9, <p) can be obtained by applying L + to Y°(9)\ using (23): 

y ™ - vfiH (if)" y » (ra * 0) <«> 

Using formulas (1) and (16), we then find: 

17(0, «>) = (~ i y , N / 2/ 4 W 1 (! + m)l Pr(COS 6) ** {m > 0) (66) 

where P” is an associated Legendre function, defined by: 

PT(u) = V(1 - u 2 T P,(u) (- 1 < U < +1) (67) 

d« m 

P?(u) is the product of \/(l - u 2 )T and a polynomial of degree (/ — m) and 
parity (— 1 ) l ~ m ; P°(u) is the Legendre polynomial P t {u). The set of Pf for fixed m 
constitutes an orthogonal system of functions: 

d u P?(u) P?(u) = f sin 9 ddPT(cos 9) Pp(cos 9) = 2/ U/ ~ m! f V (68) 
J-i Jo ' ' 

on which can be expanded functions of 9 alone. 

Formula (66) is valid for m positive (or zero); for negative m, it suffices to use 
relation (55) to obtain: 

>7(0, <p) = l 2l 4 + n 1 (| 3 ^)i p '" m ( cos eim "’ ( m < °) ( 69 ) 


687 


COMPLEMENT A 


VI 


y. Spherical harmonic addition theorem 

Consider two arbitrary directions in space, Ou' and Ou", defined respectively 
by the polar angles (d\ q>') and ( 0", q>"), and call the angle between them a. The 
following relation can be proven: 

~-j~— p ((cos a ) = i (- ir Yno\cp')Yr m ( 0 '\<p") (70) 

^ 71 m= -I 

(where P t is the /th-order Legendre polynomial). It is known as the “spherical 
harmonics addition theorem”. 


We shall indicate the main steps of an elementary proof of relation (70). First of all, 
note that, if cos a is expressed in terms of the polar angles (O', <p') and (0", <p "), the left-hand 
side of (70) can be considered to be a function of 0' and <p' ; it can therefore be expanded on the 
spherical harmonics Y”'(O', <p f ). The coefficients of this expansion, which are, of course, 
functions of the other two variables, 6 " and q>'\ can also be expanded on the spherical harmonics 
rp:(0\ (p"). We must therefore have: 

~7~ 1 P i( cos g ) = Z I Vr"(0",(p") (71) 

t',m' r, m" 

where the problem is to calculate the coefficients They can be obtained by the 

following process: 

(/) in the first place, these coefficients are different from zero only for: 

/' = r = / (72) 

To show this, first fix the direction Ou "; P,(cos a) then depends only on 6' and (p'. If the Oz axis 
is chosen along Ou'\ cos a = cos 0' and P f (cos a) is proportional to Yf(0 f ) [cf. relation (57)]. 
To generalize to the case in which the direction of Ou" is arbitrary, we perform a rotation which 
takes Oz onto this direction : cos a remains unchanged, as does P^cos a). Since the rotation 
operators (complement B v „ §3-c-y) commute with L 2 , the transform of Yf(0') remains an 
eigenfunction of L 2 with the eigenvalue /(/ + l)fi 2 , that is, a linear combination of the spherical 
harmonics Y?'{0\ <p'); we therefore have V = /. Similarly, it can be established,that l* = /. 

(ii) under a rotation of both directions Ou' and Ou" through an angle fi about Oz , the 
angle a is not changed, and neither are 0' and 0", while <p' and <p" become <p' + P and <p" + 0. 
The left-hand side of (71) therefore does not change in value, and each term of the right-hand 
side is multiplied by e l(m +m " }0 . Consequently, the only non-zero coefficients in the sum of the 
right-hand side are those which satisfy: 

m' + m" = 0 (73) 

(iii) combining results (72) and (73), we see that formula (71) can be written in the 

form: 

^2 P,(cos a) = £ (- l)“c m Y?(d\ cp’) Y-"(0\ <p") (74) 

If we set 0' = 0" and <p' = <p'\ we obtain, according to (60): 

—~ 7 ~— ~ S (- irc„ Y?(0,<p')Yr(^<P') (75) 

471 ™=-l 
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Since (— l) m y, M is simply Yf* > the integration of (75) with respect to dQ' = sin 6' 66' 6(p' 
yields, with the orthonormalization equation (45): 

2/ + 1 = I (76) 

m - -1 

We now take the square of the modulus of both sides of (74) and integrate over 6Q' 

+ i 

and dQ". Using relation (45), it is easy to see that the right-hand side yields £ |c m | 2 . 

m= —1 

As far as the left-hand side is concerned, we can again take advantage of the invariance of 

the angle a witl respect to a rotation in order to show that j*dQ' |P,(cos oc)| 2 is actually 

independent of (0", <p"). If we then choose Ou" along Oz to evaluate this integral, we find, 
according to relation (61): 

J dQ' |P,(cos a)| 2 = JdQ |P,(cos 0) \ 2 = 2n x ^ ^ (77) 

Integrating over dQ", we find a second relation between the coefficients c m : 

21 + 1 = I K\ 2 (78) 

m= -1 


(iv) equations (76) and (78) suffice for the determination of the (2/ 4- 1) coefficients c m : 

they are all equal to L To show this, consider, in a normed (21 -j- 1 )-dimensional vector space, the 

c m 1 

vector X of components x m = — —. and the vector Y of components y m = — - 

V 2/ + 1 y/21 + 1 

The Schwartz inequality indicates that: 

(X*. X)(Y*. Y) ^ |Y*. X| 2 (79) 

where there is equality if and only if X and Y are proportional. Now, (76) and (78) show that 
this is the case : x m and c m are therefore independent of m , as is y m , and we have, necessarily, 
c m = 1. This concludes the proof of formula (70). 


References : 

Messiah (1.17), App. B, §IV; Arfken (10.4), chap. 12; Edmonds (2.21), Table 1; 
Butkov (10.8), chap. 9, §§5 and 8; Whittaker and Watson (10.12), chap. XV; 
Bateman (10.39), chap. Ill; Bass (10.1), vol. I, § 17-7. 
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Complement B V i 

ANGULAR MOMENTUM AND ROTATIONS 


1. Introduction 

2. Brief study of geometrical rotations 0t 

a. Definition . Parametrization 

b. Infinitesimal rotations 

3. Rotation operators in state space. Example of a spinless particle 

a. Existence and definition of rotation operators 

b. Properties of rotation operators R 

c. Expression for rotation operators in terms of angular momentum observables 

4. Rotation operators in the state space of an arbitrary system 

a. System of several spinless particles 

b. Arbitrary system 

5. Rotation of observables 

a. General transformation law 

b. Scalar observables 

c. Vector observables 

6. Rotation invariance 

a. Invariance of physical laws 

b. Consequence: conservation of angular momentum 

c. Applications 


1. Introduction 

We indicated in chapter VI (§B-2) that the commutation relations between 
the components of an angular momentum are actually the expression of the geometr¬ 
ical properties of rotations in ordinary three-dimensional space. This is what we 
intend to show in this complement, where we investigate the relation between 
rotations and angular momentum operators. 

Consider a physical system (S) whose quantum mechanical state, at a given 
time, is characterized by the ket | ^ > of the state space S. We perform a rota¬ 
tion 3% on this system; in this new position, the state of the system is described 
by a ket 1 ip ') which is different from | ip >. Given the geometrical transformation 3t, 
the problem is to determine | ip' > from | ip ). We shall see that it has the following 
solution : with every geometrical rotation 3$ can be associated a linear operator R 
acting in the state space S such that: 

\iP'y = R\ipy (i) 

Let us immediately stress the necessity of distinguishing between the geometrical 
rotation 31, which operates in ordinary space, and its “image” R, which acts in the 
state space: 

3$ ==> R (2) 
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We shall begin (§2) by reviewing the principal properties of geometrical 
rotations We shall not embark upon a detailed study of them; rather, we 
shall simply note some results which will be useful to us later. Then, in §3, we shall 
use the example of a spinless particle to define the rotation operators R precisely, 
to study their most important properties, and to determine their relation to the 
angular momentum operators L. We shall then be able to interpret the commuta¬ 
tion relations amongst the components of the angular momentum L as the 
image, in the space S t , of purely geometrical characteristics of rotations 3t. We shall 
then generalize (§ 4) these concepts to arbitrary quantum mechanical systems. 
In §5, we shall examine the behavior of the observables describing the physical 
quantities which are measurable in this system under a rotation of the system. 
This will lead us to classify observables according to how they transform under 
a rotation (scalar, vector, tensor observables). Finally, in §6, we shall briefly consider 
the problem of rotation invariance and indicate some important consequences of 
this invariance. 


2. Brief study of geometrical rotations 3t 

a. DEFINITION. PARAMETRIZATION 

A rotation ^ is a one-to-one transformation of three-dimensional space 
which conserves a point of this space, the angles and the distances, as well as the 
handedness of the reference frames*. We shall be concerned here with the set of 
rotations which conserve a given point O, which we shall choose as the origin of the 
reference frame. A rotation can then be characterized by the axis of rotation (given 
by its unit vector u or its polar angles 6 and <p) and the angle of rotation a (0 ^ a < 27i). 
To determine a rotation, three parameters are required; they can be chosen to be 
the components of the vector: 

a = au (3) 

whose absolute value is equal to the angle of rotation and whose direction defines 
the axis of rotation. Note that a rotation can also be characterized by three 
angles, called Euler angles. We shall denote by ^f.(a) the geometrical rotation 
through an angle a about the axis defined by the unit vector u. 

The set of rotations constitutes a group : the product of two rotations 
(that is, the transformation resulting from the successive application of these two 
rotations) is also a rotation; there exists an identity rotation (rotation through a zero 
angle about an arbitrary axis); for every rotation 3ijx) there is an inverse rota¬ 
tion, _„(«). The group of rotations is not commutative : in general, the product 
of two rotations depends on the order in which they are performed**: 

<*.(«)*„.(«') ¥= t% a (<x')3lju) ( 4 ) 


* This last property is imposed in order to exclude reflections with respect to a point or a plane. 

** When one writes & 2 &i> this means that rotation must be performed first, & 2 being applied 
subsequently to the result obtained. 
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Recall, however, that two rotations performed about the same axis always commute : 

^„(a) = ^.(a) .#„(a) = + a') (5) 

(if necessary, 2n is subtracted from a + a', to keep it within the interval [0,27t]). 

b. INFINITESIMAL ROTATIONS 

An infinitesimal rotation is defined as a rotation which is infinitesimally 
close to the identity rotation, that is, a rotation 3?Jda) through an infinitesimal 
angle da about an arbitrary axis u. It is easy to see that the transform of a vector OM 
under the infinitesimal rotation ^„(da) can be written, to first order in da: 

^„(da) OM = OM + da u x OM (6) 

Every finite rotation can be decomposed into an infinite number of infinitesimal 
rotations, since the angle of rotation can vary continuously and since, according 
to (5): 

a + da) = ^ u (a) ^„(da) = ^«(da) ^«(a) (7) 

where ^? u (da) is an infinitesimal rotation. Thus, the study of the rotation group 
can be reduced to an examination of infinitesimal rotations*. 

Before ending this rapid survey of the properties of geometrical rotations, 
we note the following relation which will be useful to us later : 

da') £Jda) ^ Cy (da') da) = ^Jda da') (8) 

where e x , e y and e z denote the unit vectors of the three coordinate axes Ox, Oy 
and Oz respectively. If da and da' are first-order infinitesimal angles, this relation is 
correct to the second order. It describes, in a special case, the non-commutative 
structure of the rotation group. 

To prove relation (8), let us apply its left-hand side to an arbitrary vector OM. We use for¬ 
mula (6) to find the vector OM', the transform of OM under the succesive action of the four 
infinitesimal rotations. It can be seen immediately that if da is zero, the left-hand side of (8) 
reduces to the product @ ty (— da ) 3? Cy (da'), which is equal to the identity rotation [see (5)]; 
the vector OM' — OM must therefore be proportional to da. For an analogous reason, it 
must also be proportional to da'. Consequently, the difference OM' — OM is proportional 
to dada'. 

Therefore, to calculate OM' to second order, we may restrict ourselves to first order in 
each of the two infinitesimal angles da and da'. First of all, according to (6): 

da) OM = OM - da e x x OM (9) 

We must then apply ^?. y (da') to this vector; this can be done by again using (6): 

#. y (da'MJ-da)OM 

= (OM — da e x x OM) + da' c y x (OM — da e x x OM) 

= OM — da e x x OM + da' e v x OM — da da' e y x (e x x OM) (10) 

* However, limiting ourselves to infinitesimal rotations, we lose sight of a “global” property 
of the finite rotation group : the fact that a rotation through an angle of 2 n is the identity transformation. 
The rotation operators (see §3) constructed from infinitesimal operators do not always have this global 
property. In certain cases (see complement A IX ), the operator associated with a 2 n rotation is not the 
unit operator but its opposite. 
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The action of ,^ e Jdx) on the vector appearing on the right-hand side of (10) results in the 
addition of The following infinitesimal terms to this vector: 

da e x x OM + da da' e x x (e y x OM) (11) 

obtained by the vector multiplication of the right-hand side of (10) by dae x ,with only the first 
order terms in da being retained. Therefore: 


*. x (d«) ) *.,(- da) OM 

= OM + da' e,, x OM + da da'[e x x (e y x OM) — e y x (e x x OM)] (12) 

Finally, OM' is equal to the sum of the vector just obtained and its vector product 

by — da' e v . To first order in da', this vector product can be written simply : 

- da' e, x OM ( 13 ) 

which means that: 

& €y (- da') &'Jda) & €y (dx da) OM 

= OM + da da'[e x x (e y x OM) — e y x (e x x OM)] (14) 

It is then easy to transform the double vector products; we find: 

- da').# e Jda)# ej ,(da'),# tjc (- da)OM = OM + da da' e* x OM 

= # ei (da da') OM (15) 


Since this relation is true for any vector OM, expression (8) is verified. 


3. Rotation operators in state space. 

Example: a spinless particle 

In this section, we consider a physical system composed of a single 
(spinless) particle in three-dimensional space. 


a. EXISTENCE AND DEFINITION OF ROTATION OPERATORS 

At a given time, the quantum mechanical state of the particle is characte¬ 
rized, in the state space S\, by the ket 1 1 // > with which is associated the wave 
function i^(r) = < r 1 1 //). Let us perform a rotation 0t on this system which 
associates with the point r 0 (x 0 , y 0 , z 0 ) of space the point r^x^, y' 0 , z' 0 ) such that: 

r’ Q = 0tr o (16) 

Let | i / a ' ) be the state vector of the system after rotation, and ip'(r) = <r| if/’}, 
the corresponding wave function. It is natural to assume that the value of the 
initial wave function i//(r) at the point r 0 will be found, after rotation, to be the 
value of the final wave function i^'(r) at the point r' 0 given by (16): 

r(?o)-n*o) (* 7 ) 

that is: 

( 18 ) 
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Since this equation is valid for any point r' 0 in space, it can be written in the form : 

^'(r) = ^(^ _1 r) (19) 

By definition, the operator R in the state space S" t associated with the 
geometrical rotation 01 being treated is the one which acts on the state 1 1 // > before 
rotation to yield the state 1 1 //') after the rotation 01 : 

| r> = R\ils> ( 20 ) 

R is called a “rotation operator”. Relation (19) characterizes its action in the { | r > ) 
representation: 

<r|R|^> = <^- 1 r|^> (21) 

where \0t~ i r} is the basis ket of this representation determined by the components 
of the vector 01 ~ 1 r. 

comment: 

If the state of the particle after rotation were e‘ # i/'' ) (where 0 is an arbitrary real 
number) instead of | >, its physical properties would not be modified. In other words, 

relation (17) could be replaced by: 

nr o) = no) (22) 

0 would obviously be independent of r 0 but could depend on the rotation M. We shall 
not treat this difficulty here. 


b. PROPERTIES OF ROTATION OPERATORS R 

a. R is a linear operator 

This essential property of rotation operators follows from their very definition. 
If the state 1 1 // > before the rotation is a linear superposition of states, for example: 

|*>=M*1> + A2|*2> (23) 

formula (21) indicates that: 

< r | R < 0t~ l r > + X 2 (0t~ l t |iA 2 > 

= Ai < r | /? |^ 1 > + A 2 <r|R|^ 2 > (24) 

Since this relation is true for any ket of the { |r> } basis, we deduce that R is a 
linear operator: 

R > = R 1*1 > + 2 2 \*2 >] = KR I > + 2 2 R I Ij, 2 > (25) 

p. R is unitary 

In formula (21), the ket | ip > can be arbitrary. The action of the operator R 
on the bra < r | is therefore given by: 

< r | /? = < 0l~ l r | (26) 
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Taking the Hermitian conjugate of both sides of equation (26), we obtain: 

R' |r> = \ST l ty (27) 

Moreover, if we recall that the ket | r ) represents a state in which the particle 
is perfectly localized at the point r, we see that: 

/?|r> = |^r> (28) 

This equation simply expresses the fact that if the particle was localized at the 
point r before the rotation, it will be localized at the point r' = Mx after the 
rotation. To get (28) from (21), we choose a basis state |r 0 > for It/'): 

< r | R | r 0 > = < ‘r | r 0 > - 5[(£" 1 r) - r 0 ] (29) 

where we have used the orthonormalization relation of the { | r > } basis. 
Furthermore*: 

r) - r 0 ] = 8[r - r 0 )] (30) 

Substituting (30) into (29), we indeed find that: 

< r |/?|r 0 > = % - (^r 0 )] = <r |^r 0 > (31) 

that is, since { | r > J is a basis of £ r : 

/?|r 0 > = |^r 0 > (32) 

Starting with formulas (27) and (28), it is easy to show that: 

RR' = R'R = 1 (33) 

since the action of RR* or R'R on any vector of the { | r > } basis yields the same 
vector; for example: 

RR* | r > = /? | 3/t~ x x > = | > = | r > (34) 

The operator R is therefore unitary. 

comment : 

The operator R therefore conserves the scalar product and the norm of 
vectors that it transforms: 

(35) 

I v > = R I <P > J 

This property is very important from the physical point of view since the 
probability amplitudes which yield physical predictions appear in the form of 
scalar products of two kets. 

* Relation (30) can easily be established by using the definition of delta ‘functions” and the fact 
that a rotation conserves the infinitesimal volume element. 
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y. The set of operators R constitutes a representation of the rotation group 

We have pointed out (§ 2) that the geometrical rotations form a group; in 
particular, the product of two rotations and 0t 2 is always a rotation: 

^ 2 ^i = St 3 (36) 

With the three geometrical rotations , A 2 and A, are associated, in the 
state space 8 1 , three rotation operators *,, R 2 and R 3 , respectively. If the three 
geometrical rotations satisfy relation (36), we shall show that the corresponding 
rotation operators are such that: 

* 2*1 = *3 ( 37 ) 

(* 2 *, is a product of operators of S t as defined in chapter II, §B-3-a). 

Consider a particle whose state is described by an arbitrary ket |r> of the 
basis characterizing the { |r>} representation. If we perform the rotation Pi x 
on this particle, its state becomes: 

*i | r > = > (38) 

by definition of *,. Now, we perform the rotation P/t 2 on the new state we have 
just obtained; the state of the particle after this second rotation is, according to (38) 
and the definition of * 2 : 

* 2 *, |r > = * 2 |£,r> = |sM?,r> (39) 

If we take (36) into account, we see that relation (39) is equivalent to: 

*2*1 | r > = | ^ 3 r > (40) 

Now, the operator R 3 , associated with the rotation Pi 3 , is such that: 

* 3 |r> = |t* 3 r> (41) 

Relation (37) is therefore proven, since the ket |r> under consideration can be 
chosen in an arbitrary way from the kets of the { | r > } basis. 

To express the important result which we have just established, one says 
that the correspondence PA, => * between geometrical rotations and rotation 
operators conserves the group law, or that the set of operators * constitutes a 
“representation” of the rotation group. Of course, with the identity rotation, 
we associate the identity operator in 8 t , and with the rotation PA~ y (the inverse 
of a rotation Pi), the operator * _1 , which is the inverse of the one which cor¬ 
responds to Pi (we showed in §3-b-p that *" 1 = R '). 


c. EXPRESSION FOR ROTATION OPERATORS 

IN TERMS OF ANGULAR MOMENTUM OBSERVABLES 


a. Infinitesimal rotation operators 

First of all, let us consider an infinitesimal rotation about the Oz axis, 
jA e Jda). If we apply it to a particle whose state is described by the wave function i//(r) y 
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we know from (19) that the wave function ^'(r) associated with the state of the 
particle after rotation satisfies: 

^'(r) = </'D# e “ I (d<*) r] (42) 

But if (x, y, z) are the components of r, those of r can easily be calculated 

from (6): 


•^ej(da) r = #_ t2 (da) r = ( r - da e z x r ) 

Equation (42) can then be written in the form: 

i p'(x, y, z) = i l/{x + y da, y — x da, z) 
which yields, to first order in da : 

• y(x, y, z) = •/'(*, y, z) + daj^y-^ - x|yj 


x + y da 
y — x da 


= y. z) - da x-^ ~ yj~ y, 


Within the brackets, we recognize, to within a factor of —, the expression in 

the { | r > } representation for the operator L z = XP y — YP X . We therefore obtain 
the result: 


<A'( r ) = < r | ¥ > = < r | ( 1 - x da L 2 \ \ ij/ > 


Now, by definition of the operator R tz (da) associated with the rotation ^ ei (da): 

| r> = R.,(da)|^> (47) 

Therefore, since the original state | iff ) is arbitrary, we find, finally, that: 


da) = 


The preceding argument can easily be generalized to an infinitesimal rotation 
about an arbitrary axis. We therefore have, in general: 


/? u (da) = 1 - jrdaL.u 


COMMENT: 


(46) can also be quickly established by using the spherical coordi- 

Ti d 

nates (r, 0, <p), since L z then corresponds to the differential operator j—. 
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p. Interpretation of the commutation relations for the components of the angular 
momentum L 

What then is the "‘image” in the state space of relation (8)? According 
to the results of § 3-b-y and the expressions we have just obtained, this relation 
implies that, to first order with respect to each of the angles da and da': 


1 + ^ da'Lj, 1 



-da'L, 


1 + T da L x 
n 


= 1 — — da da'L, 
h 


(50) 


Expanding the left-hand side and setting the coefficients of da da' equal, we easily 
find that relation (50) reduces to: 

[L x ,L y ] = ihL z (51) 

Of course, the two other commutation relations for the components of L can be 
found, by an analogous argument, from the formulas obtained from (8) by 
cyclic permutation of the vectors e x , e y and e 2 . 

Thus, the commutation relations of the orbital angular momentum of a particle 
can be seen to be consequences of the non-commutative structure of the geometrical 
rotation group. 

y. Finite rotation operators 

Now, consider a rotation ^? ei (a) through an arbitrary angle a about the Oz 
axis. According to formula (7), the operator a) associated with such a rotation 
must satisfy (again using the results of § 3-b-y): 

+ da) = Rja) R t f da) (52) 


where the two operators of the right-hand side commute. But we know the 
expression for /? ej (da), so we have: 

RJx + da) = R tz ( a) |^1 - ^da L Z J (53) 

that is: 

Re ja + da) - Rjx) = - | da /?Ja) L z (54) 


Here again, 7? #i (a) and L z must commute. Although we are dealing with operators, 
the solution of equation (54) is formally the same as it would be if we were consi¬ 
dering an ordinary function of the variable a: 


RJ«) = e“* a 


(55) 


Indeed, if we recall (cf. complement B„, §4) that the exponential of an operator is defined 
by the corresponding power series expansion, it is easy to verify that expression (55) is the 
solution of equation (54). Moreover, the “integration constant” is equal to 1, since we know 
that: 


JU<>) = i 


(56) 
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As in § 3-c-a above, it is easy to generalize this result to a finite rotation 
about an arbitrary axis: 


/?„(«) = e * 



COMMENTS : 


(i) Formula (57) can be written explicitly in the form: 

R u (a) = e a(Lx “ x+LyUy+Ll “ z) ( 58 ) 

where u x , u y and u z are the components of the unit vector u. Recall, however, 
that, since L x , L y and L z do not commute: 

R u (ot) # e~li aLxax Q-'r L * u > (59) 

(ii) It can be seen from expression (57) that the operator R u (a) is unitary. Since 
the components of L are Hermitian: 

(60) 

we have (as L . u obviously commutes with itself): 

= = 1 ( 6 i) 

(iii) In the special case envisaged in this section, we find that: 

*„( 2k) = 1 (62) 

We shall confine ourselves to proving this result for the rotation through 2n 
about the Oz axis (the generalization of this proof involves no difficulties). 
To this end, consider an arbitrary ket 1 1 ft >, and expand it on a basis composed 
of eigenvectors of the observable L z : 


> = Z c m, t I T > 


with: 


L z | m, x > = nih \ m, r > (64) 

(r symbolizes the indices other than m which are necessary in order to 
specify the vectors of the basis used, which may, for example, be a “standard” 
basis { | k, l, m > } such as those introduced in §C-3 of chapter VI). The 
action of R tt (a) on | tj/ > is then easy to obtain : 


Ke» I 4> > = I C m>t t | m, T > 


= I C m,r 


m, t > 
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But we know that, for the orbital angular momentum of a particle, m is always 
integral. Consequently, when a attains the value 2n, all the factors e~ ,am 
become equal to 1, and : 

K J2tt) | ip > = I c m , t | m, t > = | i/f > (66) 

m,r 

Since this relation is satisfied for all | ip ), we deduce that R ti (2n) is the 
identity operator. 

The preceding argument clearly indicates that formula (62) would not be 
valid if half-integral values of m were not excluded. Indeed, we shall see in 
complement A IX that, for a spin 1/2, the operator associated with a rotation 
of 2n is equal to — 1 and not 1; this result is related to the fact that we 
constructed the finite rotations from infinitesimal rotations (cf note, page 692). 


4. Rotation operators 

in the state space of an arbitrary system 

We shall now generalize the concepts we introduced and the results we 
obtained for a special case (in §3). 


a. SYSTEM OF SEVERAL SPINLESS PARTICLES 

First of all, the arguments of §3 can be extended without difficulty to systems 
composed of several spinless particles. We shall quickly demonstrate this, choosing 
as an example a system of two spinless particles, (1) and (2). 

The state space $ of such a system is the tensor product of the state spaces S ti 
and S t2 of the two particles: 

s = S ti ® s t2 (67) 

We shall use the same notation as in §F-4-b of chapter II. Starting from the 
position and momentum observables (R t and on the one hand, R 2 and P 2 on the 
other), we can define an orbital angular momentum for each of the particles: 

L, = R, x Pj 

L 2 = R 2 x P 2 (68) 

The components of L,, as well as those of L 2 , satisfy the commutation relations 
characteristic of angular momenta. 

Consider a vector which is a tensor product of a vector of S r and a vector 

of/,,: 

I* > = I «•(!)> ® I *P)> (69) 

1 1 1/ > represents the state of the system formed by particle (1) in the state | <p(l) > 
and particle (2) in the state | *(2) ). If we perform a rotation through an angle a 
about u on the two-particle system, the state of the system after the rotation 
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corresponds to the two particles in the “rotated states” |cp'(l)> and \x'(2)} 
respectively: 

|<A'> = | </>'(!) > ® | X'(2) > = [R l M | 9>(0 >] ® [* 2 («) | X(2) >] (70) 

where /?i(a) and are the rotation operators in £ ti and S tl : 

/?» = e^ aL, ° (71-a) 


W = e 




(71-b) 


Relation (70) can also be written, by definition of the tensor product of two 
operators (chap. II, §F-2-b): 

|r > = [*» ® * 2 («)] | <p(l) > ® | X (2)> (72) 

Since every vector of $ is a linear combination of vectors analogous to (69), the 
rotation transform | \j/' > of an arbitrary vector | \j/ > of $ is: 

|<K > = [R»® (73) 


Using formula (F-14) of chapter II and the fact that Lj and L 2 commute (they are 
operators which relate to different particles), we obtain for the rotation operators 
in &: 


® = e » 




i - —a L 2 .u - — a L.u 

e n = e n 


(74) 


where: 


L = L, + L 2 


(75) 


is the total angular momentum of the two-particle system. All the formulas of the 
preceding section therefore remain valid as long as L represents the total 
angular momentum. 


comments: 

(i) L is an operator which acts in S. In (75), L, is, rigorously, the extension 
of the operator L t acting in S ti into S (an analogous comment could be made 
for L 2 ). To simplify the notation, we shall not use different symbols for 
and its extension into £ ( cf. chap. II, §F-2-c). 

(ii) We might consider performing a rotation on only one of the two particles, 
for example, particle (1). In the course of such a “partial rotation”, a vector 
such as (69) transforms into: 

[«»kO)>]®U(2)> (76) 

where only the state of particle (1) is modified. As above, it can be shown 
that the effect of a rotation performed only on particle (1) on an arbitrary 
state | \j/ > of S is described by the operator: 

R»®H(2) = e _ * i “ L "" (77) 

where H is the unit operator in S\ 2 [in (77), L, acts in £\ 
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b. AN ARBITRARY SYSTEM 

The starting point of the arguments elaborated thus far is equation (19), 
which gives the transformation law of the state vector of the system in terms 
of that of its wave function. In the case of an arbitrary quantum mechanical 
system (which does not necessarily have a classical analogue), one cannot use 
the same method. For example, for a particle with spin, the operators X, Y and Z 
no longer form a C.S.C.O., and the state of the particle can no longer be defined 
by a wave function *p(x, y, z) (cf. chap. IX). One must reason directly in the state 
space £ of the system. Without going into detail, we shall assume here that an 
operator R acting in £ can be associated with any geometrical rotation 0t\ if the 
system is initially in the state | ^ ), the rotation 01 takes it into the state: 

|*'> = *|*> (78) 

where the operator R is linear and unitary (cf. comment of §3-b-(i). 

As far as the group law of the rotations 01 is concerned, it is conserved by the 
operators R, but only locally: the product of two geometrical rotations, at least 
one of which is infinitesimal, is represented in the state space £ by the product 
of the corresponding operators R (which implies, in particular, that the “image” 
of a rotation through an angle of zero is the identity operator). However, the 
operator associated with a geometrical rotation through an angle 2n is not necessarily 
the identity operator [ cf. comment (iii) of § 3-c-y and complement A IX ]. 

Now, let us consider an infinitesimal rotation (da) about the Oz axis. 
Since the group law is conserved for infinitesimal rotations, the operator /? ei (da) 
is necessarily of the form: 

*.,(d«) = 1 - { da J z (79) 

where J z is a Hermitian operator since 7? ej (da) is unitary (cf complement C,„ §3). 
This relation is the definition of J z . Similarly, the Hermitian operators J x and J y 
can be introduced by starting with infinitesimal rotations about the Ox and Oy axes. 
The total angular momentum J of the system is then defined in terms of its three 
components J x , J y and J z . 

Now we can use the reasoning of §3-c-p: the geometrical relation (8) implies 
that the components of J satisfy commutation relations which are identical to 
those of orbital angular momenta. Thus, the total angular momentum of any 
quantum mechanical system is related to the corresponding rotation operators; the 
commutation relations amongst its components follow directly, which enables us 
to use them, as in chapter VI (§B-2), to characterize any angular momentum. 

Finally, let us show that, with J x , J y and J z defined as we have just indicated, 
the operator /? u (da) associated with an arbitrary infinitesimal rotation is written 
(u x , u y and u. being the components of the unit vector u): 

RJ. da) = 1 - (J x u x + J y u y + J T u z ) (80) 

which can then be condensed into the form: 

R„(da) = 1 - rJ-daJ.u (81) 

n 
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Formula (80) is simply a consequence of the geometrical relation: 
^„(da) = @ ex (u x da) @ ty (u y da) @ et (u z da) 


(82) 


valid to first order in da, and which can be obtained directly from formula (6). 

We have thus generalized expressions (48) and (49) for infinitesimal rotation 
operators. Since the group law is conserved locally (see above), relation (52) and 
the argument following it remain valid. Consequently, the finite rotation operators 
have expressions analogous to (55) and (57): 


R u( “) = e 


i 



J.u 


(83) 


5. Rotation of observables 

We now know how the vector representing the state of a quantum mechanical 
system transforms under rotation. But in quantum mechanics, the state of a system 
and the physical quantities are described independently. Therefore, we shall now 
indicate what happens to observables upon rotation. 

a. GENERAL TRANSFORMATION LAW 

Consider an observable A, relating to a given physical system; we shall assume, 
to simplify the notation, the spectrum of A to be discrete and non-degenerate: 

^ I «„ > = I > ( 84 ) 

In order to understand how this observable is affected by a rotation, we shall 
imagine that we have a device which can measure A in the physical system under 
consideration. Now, the observable A', the transform of A with respect to the 
geometrical rotation is by definition what is measured by the device when 
it has been subjected to the rotation 

Let us assume the system to be in the eigenstate | u n > of A: the device for 
measuring A in this system will give the result a n without fail. But just before 
performing the measurement, we apply a rotation $ to the physical system and, 
simultaneously, to the measurement device; their relative positions are unchanged. 
Consequently, if the observable A which we are considering describes a physical 
quantity attached only to the system which we have rotated (that is, independent 
of other systems or devices which we have not rotated), then, in its new position, 
the measurement device will still give the same result a n without fail. Now, after 
rotation, the device, by definition, measures A', and the system is in the state: 


10 = ^10 

(85) 

We must therefore have: 


A | u„ > = a„ | u„ > => A’ | u’„ > = a„ | u' n > 

(86) 

Combining (85) and (86), we find: 


A'R | u„ > = a„R | u„ > 

(87) 
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that is: 

R'A 'R | «„ > = a n | u„ > (88) 

since the inverse of R is R\ The set of vectors | u n ) constitutes a basis in the state 
space (A is an observable), so we have: 

RWR = A (89) 

that is: 


A'= RAR' 


(90) 


In the special case of an infinitesimal rotation ^ u (da), the general expres¬ 
sion (81), substituted into (90), gives, to first order in da: 


A' 


= (1 — ^-daJ.uVl^l T^daJ.u 


= A — da [J.u, /l] 


(91) 


COMMENTS : 

(i) In the case of a spinless particle, relation (90) implies that: 

< r | A' | r' > = < r | RAR' | r' > (92) 

Using (26) and (27), we therefore obtain: 

<r|^'|r'> = |^"V> (93) 

The transformation which enables us to obtain A' from A is therefore 
completely analogous to the one which gives 1> in terms of 1 1 {/ > [cf. (19)]. 

(ii ) Consider the case in which the observable A is associated with a classical quan¬ 
tity si . si is then a function of the positions r i and momenta p { of the 
particles which constitute the system; the operator A is obtained from this 
function by applying the quantization rules given in chapter III. We know how 
to find the quantity si' associated with si by a rotation 3ft in classical 
mechanics : for example, if si is a scalar, si' is the same as si ; if si is the 
component along an axis Ou of a vectorial quantity, si' is the component 
of this same vectorial quantity along the axis which is the result of the 
transformation of Ou by the rotation 3ft. We can also construct the quantum 
mechanical operator corresponding to si', by applying the same quantization 
rules as above. It can be shown that this operator is the same as the 
operator A' given in (90); this is what is shown in figure 1. 



FIGURE 1 

Behavior, under a rotation of a classical physical quan¬ 
tity s4 and of the associated observable A . 


704 


ANGULAR MOMENTUM AND ROTATIONS 


b. SCALAR OBSERVABLES 

An observable A is said to be scalar if: 

A' = A (94) 

for all R. According to (91), this implies that: 

[A, J] = 0 (95) 

A scalar observable commutes with the three components of the total angular 
momentum. 

There are numerous examples of scalar observables. J 2 is always a scalar 
(this results, as we saw in § B-2 of chapter VI, from the commutation relations 
which characterize an angular momentum). For a spinless particle, R 2 , P 2 and R. P, 
which correspond to classical scalar quantities, are scalars. It is easy to show, 
moreover (cf §5-c below) that R 2 , P 2 and R . P satisfy (95). We shall also see 
later (§6) that the Hamiltonian of an isolated physical system is a scalar. 

c. VECTOR OBSERVABLES 

A vector observable V is a set of three observables V x , V y , V 2 (its 
Cartesian components) which is transformed by rotation according to the 
characteristic law of vectors. The transform, under a rotation 01, of the compo¬ 
nent V u = V . u of V along a given axis Ou (of unit vector u) must be the compo¬ 
nent V u , = V . u' of V along the axis Ou' derived from Ou by the rotation 0. 

Consider, for example, the component V x of such an observable. We shall 
examine its behavior under infinitesimal rotations about each of the coordinate 
axes. V x is obviously unchanged by a rotation about Ox; according to (91), this 


can be expressed in the form: 

I 'A, K] = 0 (96) 

If we perform a rotation 0 t (da) about the Oy axis, the transform of V x is the 
observable ( V x )' given by (91) as: 

(KY = V x -jrda[J yt V x ] (97) 

But V x is the component of V along the Ox axis, of unit vector e x . The rota¬ 

tion 0 e (da) takes e x onto e x such that [formula (6)] : 

e* = e x + da e^ x e x 

= e x — dae z (98) 

Consequently, if V is a vector observable, (V X Y must be the same as V . e^.: 

(KY = V . ejc — da V . e z 

= K- daK (99) 

Comparing (97) and (99), we see that: 

[K,j y -] = ihv z (ioo) 
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For an infinitesimal rotation 3? t Jdoc) about the Oz axis, an argument analogous to 
the one above leads to the relation : 

[4 K] = i*v> (ioi) 


By studying the behavior of V y and V z under infinitesimal rotations, one can prove 
the formulas which can be derived from (96), (100) and (101) by cyclic permutation 
of the indices x, y, z. The set of relations obtained in this way is characteristic of 
a vector observable: they imply that an arbitrary infinitesimal rotation transforms 
V . u into V . u', where u' is the transform of u with respect to the rotation under 
consideration. 

It is clear that the angular momentum J itself is a vector observable; (96), 
(100) and (101) then follow from the commutation relations characterizing angular 
momenta. For a system composed of a single spinless particle, R and P are vector 
observables, as can easily be verified from the canonical commutation relations. 
Thus the vector notation we use for R, P, L and J is justified. 


comments: 

(i) The scalar product V . W of two vector observables, defined by the customary 
formula: 

V. W = V X W X + V y W y + V Z W Z (102) 

is a scalar operator. To verify this, we can calculate, for example, the com¬ 
mutator of V . W with J x : 

[V.W,/J = [V y W y ,Jj + [V z iV z ,J x ] 

= v y [w y ,jj + [v y ,J x ]w y + v z [w z ,jf\ + [v z ,j x }w z 
= - iTiV y W z - ihV z W y + ifiV z W y + ihv y w z 
= 0 (103) 

We have already pointed out that J 2 , R 2 , P 2 and R . P are scalar observables. 

(/7) It is the total angular momentum of the system under study which appears in 
relations (96), (100) and (101). The following example illustrates the importance 
of this fact : if, for a two-particle system, we were to use L t instead of 
L = L t + L 2 , R 2 would appear to be a set of three scalar observables and 
not a vector observable. 


6. Rotation invariance 

The discussion presented in the preceding sections does not have as its sole 
purpose the justification of the definition of angular momenta in terms of the 
commutation relations. The importance of rotations in physics is essentially 
related to the fact that physical laws are rotation invariant. We are going to explain 
in this section exactly what this means, and we shall indicate some of the 
consequences of this fundamental property. 
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a. INVARIANCE OF PHYSICAL LAWS 

Consider a physical system (5), classical or quantum mechanical, which we 
subject to a rotation 0t at some given time. If we take the precaution of rotating, 
at the same time as the system (5) under consideration, all other systems or devices 
which can influence it, the physical properties and behavior of (S) are not 
modified. This means that the physical laws governing the system have remained 
the same : the physical laws are said to be rotation invariant. Note that this property 
is not at all obvious: there exist transformations — those of similarity*, for 
example — with respect to which the physical laws are not invariant**. It is therefore 
advisable to consider rotation invariance to be a postulate which is justified by 
the experimental verification of its consequences. 

When we say that the physical properties and behavior of a system are 
unchanged by a rotation performed at the time t 0 , this statement covers two 
observations: 

(/) the properties of the system at this time are not modified (although the 
description of the state of the system and the physical quantities are; see preceding 
sections). In quantum mechanics, this implies that the transform A' of an 
arbitrary observable A has the same spectrum, and that the probability of finding 
one of the eigenvalues of this spectrum in a measurement of A' on the system after 
rotation is the same as it was for the measurement of A on the system before rotation. 
From this, it can be deduced that the operators R describing rotations in state space 
are linear and unitary, or antilinear and unitary (that is, anti-unitary***). 

(ii) the time evolution of the system is not affected. To state this point more 
precisely, let us denote the state of the system by | i p(t 0 ) ); under a rotation per¬ 
formed at t 0 , this state becomes: 

I *'('„)> = R\Mo)> ( 104 ) 

We now let the system evolve freely and compare its state | *p'(t) ) at a subsequent 
time t to the state | ip{t) ) which it would have attained if it had been allowed to 
evolve freely from | i//(f 0 ) >. If the behavior of the system is not modified, we must 
have : 

\nt)> = K\m> ( 105 ) 

that is, for all t, the state |i {/’{t)} must be obtained from | >p(t) > by the same 
rotation as in (104). Therefore, if | ip{t) ) is a solution of the Schrodinger equation, 
R | >p(t) ) is also a solution of this equation : the transform of a possible motion of 
the system is also a possible motion. We shall see in §b that this implies that the 
Hamiltonian H of the system is a scalar observable. 


* Consider, for example, a hydrogen atom. If we multiply the distance between the proton and 
the electron by a constant k ^ 1 (without modifying the charges and masses of the particles), we obtain a 
system whose evolution no longer obeys either classical or quantum mechanical physical laws. 

** Let us also point out that experiments have shown that the laws governing the /?-decay of 
nuclei are not invariant under reflection with respect to a plane (non-conservation of parity). 

*** All transformations which leave the physical laws invariant are described by unitary operators, 
except for time reversal, with which is associated an anti-unitary operator. 
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The invariance of physical laws under rotation is related to the symmetry 
properties of the equations which state these laws mathematically. To understand 
the origin of these symmetries, consider, for example, a system composed of a single 
(spinless) particle. The expression of the physical laws governing such a system 
explicitly involves the parameters r(x, y, z) and p (p x , p r p z ) which characterize the 
position of the particle and its momentum : in classical mechanics, r and p define 
at each instant the state of the particle; in quantum mechanics, although the 
meaning of these parameters is a little less simple, they appear in the wave 
function i//(r) and its Fourier transform ^(p). When the particle is subjected to an 
instantaneous rotation ^2, r and p are transformed into r' and p' such that: 

r' = 0lx 

p' = ^p (106) 

If we replace r by r' and p by in the equations which express the 

physical laws, we obtain relations which now involve r' and p'. The invariance 
of physical laws under the rotation $ thus implies that the form of the equations 
for r' and p' is the same as that of the equations for r and p : simply omitting the primes 
labeling the new parameters must give us back the initial equations. It is clear that 
this considerably restrains the number of possible forms of these equations. 

comments: 

(/) What happens when we perform a rotation on a system which is not isolated? Consider, 
for example, a particle in an external potential. If we rotate this system, without 
simultaneously rotating the sources of the external potential , the subsequent evolution 
of the system is, in general, modified* In classical mechanics, the forces exerted on the 
particle are not the same in its new position. In quantum mechanics, ^'(r, 0 = l r, /) 
is a solution of a Schrodinger equation in which the potential V(t) is replaced 
by F(^ _1 r), which is, in general, different from V(r). Therefore, the transform of a 
possible motion is no longer a possible motion. The presence of the external potential 
destroys, so to speak, the homogeneity of the space in which the system under study 
evolves. 

However, the external potential may present certain symmetries which allow 
certain rotations to be performed on the physical system without its behavior being 
modified. If there exist such rotations such that V(^ x r) is the same as F(r), the 
properties of the system are unchanged by one of these rotations ^ 0 . This is the case, for 
example, for central potentials, that is, those which depend only on the distance to a fixed 
point O : the rotations are then all rotations which conserve the point O (cf chap. VII). 
(ii) Let us return to the case of isolated physical systems. Thus far, we have adopted an 
46 active ” viewpoint : the observer remains fixed, and the physical system is rotated. 


* If the particle is placed in a vector potential, its properties immediately after the rotation may be 
profoundly modified. Consider, for example, a spinless particle in an external magnetic field. According 
to the transformation law (19), the probability current given by formula (D-20) of chapter III cannot in 
general be derived by rotation of the initial current, as it depends on the vector potential describing the 
magnetic field. 

The physical interpretation of this phenomenon is as follows. We can imagine that, instead of 
rotating the particle, we rotate the magnetic field rapidly and in the opposite direction. The wave function 
has not had the time to change : this corresponds to formula (19). If the physical properties are modified, 
it is because an induced electromotive field has appeared and acted on the particle. This action does 
not depend on the exact way in which we rotate the magnetic field, provided that we do so quickly enough. 
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We can also define a “ passive ” viewpoint : the observer rotates, and, without touching 
the system being studied, uses a new coordinate frame, derived from the initial frame 
by the given rotation. Rotation invariance is then expressed in the following way : in 
his new position (that is, using his new coordinate axes), the observer describes physical 
phenomena by laws which have the same form as in the old frame. Nothing allows him 
to assert that one of his positions is more fundamental than the other : it is impossible 
to define an absolute orientation in space by the study of any physical phenomenon. 
It is clear that, for an isolated system, a “passive” rotation is equivalent to the “active” 
rotation through an equal angle about the opposite axis. 

b. CONSEQUENCE: CONSERVATION OF ANGULAR MOMENTUM 

We indicated in §6-a that rotation invariance is related to symmetry properties 
of the equations which express the physical laws. Here, we shall study the case of 
the Schrodinger equation, and we shall show that the Hamiltonian of an isolated 


physical system is a scalar observable . 

Consider an isolated system in the state We perform an arbitrary 

rotation 01 at time t 0 ; the state of the system becomes: 

|<A'(< 0 )> = R\*it 0 )> (107) 

where R is the “image” of the rotation If we now let the system evolve freely 
from 1 1 (/'(t 0 ) >, its state at the instant t 0 + d t, according to the Schrodinger 
equation, will be: 

l nt 0 + dt) > = i nt 0 ) > + jfi H \ •a'('o) > (108) 

Now, if we had not performed the rotation, the state of the system at time t 0 + dt 
would have been: 

| *(r 0 + dt) > = | #o)> + jfi H I Wo) > (109) 

Rotation invariance implies (cf. §6-a) that: 

\rit 0 + dr)> = * |<M'o + d/)> (HO) 

where R is the same as in (107). According to the two preceding equations, this 
implies that: 

RH\*Ht 0 )> = H\r(t 0 )> ( 111 ) 

that is: 

RH\t(t 0 )> = tf*|Wo)> (H2) 


Since | i//{t 0 )) is arbitrary, it follows that H commutes with all rotation operators. 
For this to be so, it is necessary and sufficient that //commute with the infinitesimal 
rotation operators, that is, with the three components of the total angular momen¬ 
tum J of the system: 

[//, J] = 0 (113) 

H is therefore a scalar observable. 
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Rotation invariance is therefore related to the fact that the total angular 
momentum of an isolated system is a constant of the motion : conservation of angular 
momentum can be seen to be a consequence of rotation invariance. 

comments: 

(?) The Hamiltonian of a non-isolated system is not, in general, a scalar. However, if certain 

rotations exist which leave the system invariant [comment (?) of §6-a], the Hamiltonian 
commutes with the corresponding operators. Thus, the Hamiltonian of a particle in a 
central potential commutes with the operator L associated with the angular momentum 
of the particle with respect to the center of forces. 

(ii) For an isolated system composed of several interacting particles, the Hamiltonian 
commutes with the total angular momentum. However, it does not generally commute 
with the individual angular momentum of each particle. For the transform of a possible 
motion to remain a possible motion, the rotation must be performed on the whole system, 
not on only some of the particles. 

c. APPLICATIONS 

We have just shown that rotation invariance means that the total angular 
momentum J of an isolated system is a constant of the motion in the quantum 
mechanical sense. It is therefore useful to determine the stationary states of 
such a system (eigenstates of the Hamiltonian) which are also eigenstates of J 2 
and J z . We can then choose for the state space a standard basis, { | k, j\ m > }, 
composed of eigenstates common to H , J 2 and J z : 

H | k,j, m'y = E | kj, m > 

J 2 1 k,j, m > = j(j + l)h 2 | k,j, m > (114) 

J z | k,j, m > = mh \ k,j, m > 

a. Essential rotational degeneracy 

Since the Hamiltonian H is a scalar observable, it commutes with J + and 
J From this fact, we deduce that | k, j, m + 1 ) and | k, j, m — 1 ), which 

are respectively proportional to J + | k, j, m ) and J_ \ k, j, m ), are eigenstates 

of H with the same eigenvalue as | k, j, m ) [the argument is the same as for 
formula (C-48) of chapter VI], Thus it can be shown by iteration that the (2 j + 1) 
vectors of the standard basis characterized by the given values of k and j have 
the same energy. The corresponding degeneracy of the eigenvalues of H is called 
“essential” because it arises from rotation invariance and occurs for any form 
of the Hamiltonian H. Of course, in certain cases, the energy levels can present 
additional degeneracies, which are called “accidental”. We shall see an example of 
this in chapter VII, §C. 

p. Matrix elements of observables in a standard basis 

When we study a physical quantity in an isolated system, the knowledge 
of the behavior of the associated observable under a rotation enables us to 
establish some of its properties, without having to consider its precise form. 
We can predict that only some of its matrix elements in a standard basis such 
as { | k, j, m } } will be different from zero, and we can give the relations between 
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them. Thus, a scalar observable has matrix elements only between two basis 
vectors whose values of j are equal, as are their values of m [this results from 
the fact that this observable commutes with J 2 and J z \ cf. theorems on 
commuting observables, §D-3-a of chapter II], Moreover, these non-zero elements 
are independent of m (since the scalar observable also commutes with J + and J_ ). 
For vector or tensor observables, these properties are contained in the Wigner- 
Eckart theorem, which we shall prove later in a special case (cf complement D x ), 
and which is frequently used in the areas of physics in which phenomena are 
treated by quantum mechanics (atomic, molecular, and nuclear physics, elementary 
particle physics, etc.). 


References and suggestions for further reading: 

Symmetry and conservation laws: Feynman III (1.2), chap. 17; Schiff (1.18), 
chap. 7; Messiah (1.17), chap. XV; see also the articles by Morrisson (2.28), Fein berg 
and Goldhaber (2.29). Wigner (2.30). 

Relation with group therory : Messiah (1.17), appendix D; Meijerand Bauer (2.18), 
chaps. 5 and 6; Bacry (10.31), chap. 6; Wigner (2.23), chaps. 14 and 15. 
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Complement C V i 

ROTATION OF DIATOMIC MOLECULES 


1. Introduction 

2. The rigid rotator. Classical study 

a. Notation 

b. Motion of the rotator. Angular momentum and energy 

c. The fictitious particle associated with the rotator 

3. Quantization of the rigid rotator 

a. The quantum state and observables of the rotator 

b. Eigenstates and eigenvalues of the Hamiltonian 

c. Study of the observable Z 

4. Experimental evidence for the rotation of molecules 

a. Heteropolar molecules. Pure rotational spectra 

b. Homopolar molecules. Raman rotational spectra 


1. Introduction 

In §1 of complement A v , we studied the vibrations of the two nuclei of 
a diatomic molecule about their equilibrium position, neglecting the rotation of 
these two nuclei about their center of mass. Thus we obtained stationary vibra¬ 
tional states of energies E v whose wave functions <p v (r ) depended only on the 
distance r between the nuclei. 

Here, we shall adopt a complementary point of view: we shall study the 
rotation of the two nuclei about their center of mass, neglecting their vibrations. 
That is, we shall assume that the distance r between them remains fixed and 
equal to r e (where r e represents the distance between the two nuclei in the stable 
equilibrium position of the molecule; see figure 1 of complement A v ). The wave 
functions of the stationary rotational states then can depend only on the polar 
angles 6 and (p which define the direction of the molecular axis. We shall see that 
these wave functions are the spherical harmonics T"(0, (p) [studied in chapter VI 
(§ D-l) and in complement A VI ], and that they correspond to a rotational 
energy E t which depends only on /. 

Actually, in the center of mass frame, the molecule both rotates and vibrates, 
and the wave functions of its stationary states must be functions of the three 
variables r, 6 and (p. We shall show in complement F VII that, to a first 

approximation, these wave functions are of the form <p„(r)T“(0, <p) and correspond 

to the energy E v + E t . This result justifies the approach adopted here, which 
consists of considering only one degree of freedom — rotational or vibrational — 
at a time*. We shall begin in § 2 by presenting the classical study of a system 


* In complement F vn , we shall also study the corrections introduced by the coupling between 
the vibrational and rotational degrees of freedom. 
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of two masses separated by a fixed distance (rigid rotator). The quantum 
mechanical treatment of this problem will then be taken up in §3, where we shall 
use the results of chapter VI concerning orbital angular momentum. Finally, 
in §4, we shall describe some experimental manifestations of the rotation of 
diatomic molecules (pure and Raman rotational spectra). 


2. Rigid rotator. Classical study 
a. NOTATION 

Two particles, of mass and m 2 , are separated by a fixed distance r e . 
Their center of mass O is chosen as the origin of a coordinate frame Oxyz with 
respect to which the direction of the axis connecting them is defined by means 
of the polar angles 6 and <p (fig. 1). The distances OM l and OM 2 are denoted 
respectively by r t and r 2 ; by definition of the center of mass: 

m l r l = m 2 r 2 

which allows us to write: 

Jj_ = Jj_ r e 

m 2 m, m, + m 2 


( 1 ) 

( 2 ) 



FIGURE 1 

Parameters defining the position of the rigid 
rotator M t M 2 whose center of mass is at the 
origin O of the reference frame; the distances r t 
and r 2 are fixed; only the polar angles 0 and <p 
can vary. 


The moment of inertia / of the system with respect to O is equal to: 

/ = m^rl + m 2 r\ (3) 

Introducing the reduced mass: 

= m t m 2 ( 4 ) 

F m, +m 2 
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and using (2), we can put I in the form: 
/ = 


( 5 ) 


b. MOTION OF THE ROTATOR. ANGULAR MOMENTUM AND ENERGY 


If no external force acts on the rotator, the total angular momentum & of 
the system with respect to the point O is a constant of the motion. The rotator 
therefore rotates about O in a plane perpendicular to the fixed vector with 
a constant angular velocity co R . The modulus of S£ is related to a> R by: 

\S£\ = m x r x r ,to R + m 2 r 2 r 2 o) R = I a> R (6) 

that is, using (5): 


\&\ = ( 7 ) 

The rotational frequency of the system v R = a> R /2n is proportional to the angular 
momentum |JS?| and inversely proportional to the moment of inertia I. 

In the center of mass frame, the total energy of the system reduces to the 
rotational kinetic energy: 

M i ( 8 ) 


which can also be written, using (6) and (5): 



jSf 2 


(9) 


C. THE FICTITIOUS PARTICLE ASSOCIATED WITH THE ROTATOR 

Formulas (5), (7) and (9) show that the problem we are studying here is 
formally equivalent to that of a fictitious particle of mass n forced to remain 
at a fixed distance r e from the point O, about which it rotates with the angular 
velocity a> R . & is the angular momentum of this fictitious particle with respect to O. 


3. Quantization of the rigid rotator 


a. THE QUANTUM MECHANICAL STATE AND OBSERVABLES OF THE ROTATOR 


Since r e is fixed, the parameters defining the position of the rotator (or 
that of the associated fictitious particle) are the polar angles 6 and <p of figure 1. 
The quantum mechanical state of the rotator will then be described by a wave 
function i//(d, <p) which depends only on these two parameters. ^(0, (p) is square- 
integrable; we shall assume it to be normalized: 


m 2 71 

d cp 

Jo 


/% n 

sin 0 dO 1 1 1/(0, <p)| 2 = 1 
Jo 


( 10 ) 
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The physical interpretation of *1/(6, (p ) is the following: |i (/(d, (p) | 2 sin 6 d0 d<p 
represents the probability of finding the axis of the rotator pointing in the solid 
angle element d £2 = sin 6 d0 d<p about the direction of polar angles d and <p. 

Using Dirac notation, we associate with every square-integrable function 
i (/(6, (p), a ket | \p ) of the state space S n : 

'H0,(p) ^*\i(t)e£ a ( 11 ) 

The scalar product of | if/ ) and | * > is, by definition: 


< x I iA > 


dQx*(0, (p)\l/(Q, (p) 


( 12 ) 


where *(0, (p) and i(/(6, (p) are the wave functions associated with | x ) and | ^ ). 

The quantum mechanical Hamiltonian H of the rotator (or of the associated 
fictitious particle) can be obtained by replacing JSf 2 m expression (9) for the 
classical energy by the operator L 2 studied in § D of chapter VI: 


H = 



(13) 


H is an operator acting in According to formula (D-6-a) of chapter VI, 
if | ip ) is represented by the wave function \(/(d, (p), H | ip ) is represented by: 


W !«/,> 


V_\&_ , 1 d 

2nr] _d0 2 tan 6 dO 


+ 


1 d 1 ~ 
sin 2 0 d(p 2 _ 


1^(0, (p) 


(14) 


Other observables of interest, which we shall study later, are those which 
correspond to the three algebraic projections x, y, z of the segment M X M 2 
(a:, y, z are also the coordinates of the fictitious particle) : 

x — r e sin 0 cos (p 

y = r e sin 6 sin (p (15) 

z = r e cos 0 

The importance of these variables will be seen in § 4-a. The observables X, Y, Z 
corresponding to x, y, z act in S n . With the kets X\if/ }, Y \ i(/ ), Z | ^ >, are 
associated the functions: 


X\ip } r e sin 6 cos <p (p) 

Y | ^ r e sin 0 sin q> *p(0, <p) (16) 

Z j ^ r e cos 6 i p{d, (p) 


comment: 

As we have already pointed out in the introduction, the true wave functions of the 
molecule depend on r, 9, <p. Similarly, the observables of this molecule, obtained from the 
corresponding classical quantities by the quantization rules of chapter III, act on these 
functions of three variables and not solely on the functions of 9 and <p. In complement F V h, 
we shall justify the point of view we are adopting here, namely, ignoring the radial 
part of the wave functions and considering r to be a fixed parameter which is equal 
to r e [cf. formulas (14) and (16)]. 
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b. EIGENSTATES AND EIGENVALUES OF THE HAMILTONIAN 

We determined the eigenvalues of the operator L 2 in §D of chapter VI: 
they are of the form 1(1 + l)h 2 , where l is any non-negative integer. Furthermore, 
we know an orthonormal system of eigenfunctions of L 2 : the spherical harmonics 
Y?(d, (p), which constitute a basis in the space of functions which are square- 
integrable in 6 and q> (§D-l-c-p of chapter VI). We shall denote by | /, m > the 
ket of associated with Y?(d, q >): 


Y”(d, (p) | /, m > 

(17) 

We see from (13) that: 


„ lf x /(/ + l)/i 2 | , N 

(18) 

2pr 2 e 


It is customary to set: 

B = — = —— (19) 

4nl 4n/xrl 

B is called the “rotational constant” and has the dimensions of a frequency*. 
The eigenvalues of H are thus of the form: 

E t = Bhl{l + 1) (20) 

Since, for a given value of /, there exist (21 + 1) spherical harmonics YT(9, <p) 
(m = — /, — / + 1,..., /), we see that each eigenvalue E, is (21 + l)-fold degenerate. 
Figure 2 represents the first energy levels of the rotator. The separation of two 
adjacent levels, / and / — 1, is equal to: 

E, - E,_j = Bh[l(l + 1) - /(/ - 1)] = 2 Bhl (21) 

and increases linearly with /. 


/ = 5 


/ = 4 


I = 3 
/ = 2 


10 Bh 

FIGURE 2 

O DL 

First levels of the rigid rotator, of energies 

E. = Bhl(l + 1) 

6 Bh 

(with / = 0, 1, 2...), Each level E t for which / ^ 1 
4 Bh is separated from the next lower level by an 
2 Bh energy 2Bhl. 


* The speed of light c is sometimes placed in the denominator of the right-hand side of (19). 
B then has the dimensions of an inverse length and is expressed in cm -1 (in the CGS system). 
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The eigenstates of H satisfy the following orthogonality and closure relations 
(deduced from those satisfied by the spherical harmonics, §D-l-c-P of chapter VI): 

< /, m | m' > = 8 n . S mm . 

t I I U ™ > < l,m | = 1 (22) 

1 = 0 m = -l 

The most general quantum state of the rotator can be expanded on tne states 
1 1, my: 

oo +1 

I MO > = I X C|,«(0 1 1> rn > (23) 

1 = 0 m=-l 

The component: 

c, M (0 = < I, m | MO > = Jdfl YP(0, <p) <P; 0 (24) 

evolves in time in accordance with the equation: 

= < (25) 


C. STUDY OF THE OBSERVABLE Z 

Earlier, we introduced the observables X, Y, Z which correspond to the 
projections onto the three axes of the segment M X M 2 - In this section, we shall 
study the evolution of the mean values of these observables and compare the 
results obtained with those predicted by classical mechanics. We shall confine 
ourselves to the calculation of < Z >(r) since < X >(/) and < Y >(r) have analogous 
properties. 


A Bohr frequency 


can appear in the function < Z )(t) if Z has a 


non-zero matrix element between a state | /, m ) of energy E t and a state | m' ) of 
energy E r . The first problem is therefore to find the non-zero matrix elements 
of Z. To solve this problem, we shall use the following relation, which can be 
established by using the mathematical properties of spherical harmonics 
[complement A V1 , formula (35)]: 


//2 1*12 /J _L 1\2 i yi 2 

cos e YT(6, cp) = /——-7^(0, <?)+ fer- T TT-—r^r + i(e, M (26) 

V 4r - 1 V 4(/ + l) 2 - 1 


From this, we deduce, using (16), (17) and (22): 


< /', m' Z I /, m > = r e d mm . 


[*«M- 


/f 2 - m 2 /(I + l ) 2 - 

X V 4/ 2 - 1 V ’ t+l \j 4(1 + l) 2 - 


- m 2 

2 

— 


comment: 

According to (27), the selection rules satisfied by Z are: Al = ± 1, Am = 0. 
It can be shown that for X and Y we have: Al — ± 1, = ± 1. Since the energies 

depend only on /, the Bohr frequencies are the same for < X ), < Y ) and < Z ). 
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The operator Z can therefore connect only states belonging to two adjacent 
levels of figure 2 (the corresponding transitions are represented by vertical arrows 
in figure 2). The only Bohr frequencies which appear in the evolution of < Z >(/) 
are thus of the form : 


v 


(,!- 1 



(28) 


They form a series of equidistant frequencies, separated by the interval 2 B 

(fig. 3). 


1 ■*—► 0 1 3 *—*■ 2 4-*—*-3 5*—M 



0 2 B 4 B 6 B 8 B 10 B 

FIGURE 3 

Frequencies appearing in the evolution of the mean value of the observable Z. Because of the selec¬ 
tion rule Al = ± 1, only the Bohr frequencies 2 HI (with / 5= 1), associated with two adjacent 
levels E, and E,_ t in figure 2, are observed. 


The mean value < Z )(f) can evolve only at a well-defined series of frequencies. 
This is unlike the classical case, in which the frequency of rotation v R of the 
rotator can take on any value. 

According to (27), if the system is in a stationary state | /, m ), < Z }(t) is 
always zero, even for large /.To obtain a quantum mechanical state in which 
< Z > behaves like the corresponding classical variable z, one must superpose 
a large number of states | /, m ). If we assume that the state of the system is 
given by formula (23), and that the numbers |c, m (0)| 2 have values which vary 
with / as is shown in figure 4, the most probable value of /, / M , is very large; 
the spread Al of the values of / is also very large in absolute value, but very small 
in relative value: 

l M ,Al> 1 (29-a) 

< 1 (29-b) 

‘M 


* 


l C l .™(°)| 2 


Al 



FIGURE 4 

Square of the moduli of the expan¬ 
sion coefficients of a «quasi-classical» 
state on the stationary states | l y m > 
of the rigid rotator. The spread AI is 
large; however, since the most probable 
l value of /, l M , is very large, we have 
-► Al/l M 1, and the relative accuracy 
with respect to l is very good. 
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It can then be shown that, in such a state: 

< L > 2 ~ < L 2 > ~ l M (l M + l)fi 2 ~ Ijfh 2 


(30) 


In addition, the Bohr frequencies which appear in < Z >(f) are then all very close 
(in relative value) to : 

v m ~ 2Bl M (31) 

Eliminating l M between (30) and (31), we obtain, according to (19): 


. |< L >| |< L >| 

M ~ h 2nl 

which is the equivalent of the classical relation (6). 


(32) 


comment: 


It is interesting to study in greater detail the motion of the wave packet corres¬ 
ponding to the state of figure 4. It is represented by a function of 9 and q> and can be 
considered to evolve on the sphere of unit radius. The preceding discussion shows 
that this wave packet rotates on the sphere with the average frequency v M . Because 
of the spread Al of / and the corresponding spread 2 BAI of the Bohr frequencies which 
enter into < X >, < Y ) and < Z >, the wave packet becomes distorted over time. This 
distortion becomes appreciable after a time of the order of: 


1 

2BAI 


(33) 


Since the spread of / is small in relative value, we have: 


v m t * ^ > 1 


V M 


Al 


(34) 


The distortion of the wave packet is therefore slow, relative to its rotation. 

In fact, the Bohr frequencies of the system form a discrete series of equidistant 
frequencies, separated by the interval 2 B. The motion which results from the super¬ 
position of these frequencies is therefore periodic, of period: 


T 


1 

2 B 


(35) 


with, according to (29-a): 


1 

T P x p — 

V M 


(36) 


The distortion of the wave packet is therefore not irreversible; it follows a cycle which is 
repeated periodically. This is related to the fact that the wave packet evolves on the 
unit sphere, which is a bounded surface. This behavior should be compared with that 
of free wave packets (irreversible spreading; complement G,) and that of the quasi- 
classical states of the harmonic oscillator (oscillation without distortion; complement G v ). 
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4. Experimental evidence for the rotation of molecules 

a. HETEROPOLAR MOLECULES. PURE ROTATIONAL SPECTRUM 

a. Description of the spectrum 

If the molecule is composed of two different atoms, the electrons are 
attracted by the more electronegative atom, and the molecule generally has a 
permanent electric dipole moment d 0 , directed along the molecular axis. The 
projection of the electric dipole moment onto Oz becomes, in quantum mechanics, 
an observable proportional to Z. We have seen that < Z >(f) evolves at all the 
Bohr frequencies 2 Bl (/ = 1, 2, 3, ...) shown in figure 3. Thus we see how the 
molecule is coupled to the electromagnetic field and can absorb or emit radiation 
polarized parallel to Oz*, on the condition that the frequency of this radiation 
is equal to one of the Bohr frequencies 2 Bl. 

The corresponding absorption or emission spectrum of the molecule is 
called the “pure rotational spectrum”. It is composed of a series of equidistant 
lines, the frequency separation between two successive lines being equal to 2 B, 
as in figure 3. The absorption (or emission) of the line of frequency 2 Bl corresponds 
to the passage of the molecule from the level / — 1 to the level / (or from the 
level / to the level / — 1), at the same time that a photon of frequency 2 Bl is 
absorbed (or emitted). Figure 5 represents this process schematically [(5-a) 
represents the absorption and (5-b), the emission of a photon of frequency 2Bl\ 

The pure rotational spectra of diatomic molecules therefore provide direct 
experimental proof of the quantization of the observable L 2 . 



2 Bl 




b 


FIGURE 5 

Schematic representation of the 
passage of the molecule from a 
given rotational level to the 
neighboring level with absorption 
(fig. a) or emission (fig. b) of a 
photon. 


p. Comparison with the “pure vibrational '’ spectrum 

In §l-c-a of complement A v , we studied the “pure vibrational” spectrum 
of a heteropolar diatomic molecule. It is interesting to compare this spectrum 
with the pure rotational spectrum which we are studying here. 

(/) The rotational frequencies of a diatomic molecule are generally much 
lower than the vibrational frequencies. The separation 2 B/c of two rotational 
lines varies between a few tenths of a cm -1 and a few dozen cm _1 ’s. For small 
values of /, the rotational frequencies 2 Bl therefore correspond to wavelengths of the 
order of a centimeter or a millimeter. Thus for HC1, the separation 2 Bjc is equal to 


* If we study the motion of < X >(/) and < Y >(?), we see that the molecule can also absorb or emit 
radiation polarized parallel to Ox or Oy . 
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20.8 cm" 1 , while the vibration frequency, which corresponds to 2 886 cm -1 , 
is more than a hundred times greater. 

Pure rotational spectra therefore fall in the very far infrared or the microwave 
domain. 

comment: 

As we shall show in complement F VI „ the rotation of molecules is also 
responsible for a fine structure of vibrational spectra (vibration-rotation 
spectra). 2 B can then be measured in a domain of wavelengths which is 
no longer that of microwaves. The same comment applies to the Raman 
rotational effect (§4-b below), which appears as a rotational structure of an 
optical line. 

(//) The “pure vibrational” spectrum studied in complement A v has only 
one vibrational line. This is due to the fact that the various vibrational levels are 
equidistant (if the anharmonicity of the potential is neglected) and, consequently, 
only one Bohr frequency appears in the dipole motion (selection rule Av = ±1). 
On the other hand, the pure rotational spectrum consists of a series of equidistant 
lines. 

(iii) We indicated in complement A v that the permanent electric dipole 
moment of the heteropolar molecule can be expanded in powers of r — r e in the 
neighborhood of the stable equilibrium position of the molecule: 

D(r) = d 0 + dfr - r e ) + ... (37) 

For the pure vibrational spectrum to appear, D(r) must vary with r: d i must 
therefore be different from zero. On the other hand, even if r remains fixed 
and equal to r e , the rotation of the molecule modulates the projection of the 
electric dipole onto one of the axes, provided that d 0 is different from zero. Thus 
we see that the study of the intensity of vibrational and rotational lines permits 
the separate measurement of the coefficients d x and d 0 of (37). 

y. Applications 

The study of pure rotational spectra has some interesting applications; 
we shall mention three examples. 

(i) Measurement of the separation 25 of two neighboring lines yields 
the moment of inertia / of the molecule, according to (19). If we know m l and m 2 , 
we can deduce r e , the separation of the two nuclei in the stable equilibrium position 
of the molecule [r e is the abscissa of the minimum of the curve V(r) of figure 1 
in complement A v ], Recall that measurement of the vibrational frequency yields 
the curvature of V (r) at r = r e . 

(ii) Consider two diatomic molecules N—M and N—M\ in which two 
isotopes M and M' of the same element are bound to the same atom N. Since 
the distances r e between the nuclei are equal in the two molecules, measurement 
of the ratio of the corresponding coefficients B, which can be performed with 
great accuracy, yields the ratio of the masses of the two isotopes M and M'. 

One could also compare the vibrational frequencies of the two molecules, 
but it is preferable to use the rotational spectrum, since the rotational frequencies 
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vary with l/ju [formula (19)], while the vibrational frequencies vary with 1/V 
[formula (5) of A v ], 

(iii) In the study of a sample containing a great number of identical 
molecules, the relative intensities of the lines (in absorption or emission) of the 
pure rotational spectrum yields information about the distribution of the molecules 
among the various levels E Unlike what happens in the case of the vibrational 
spectrum, transitions between two given adjacent levels (arrows of figure 2) occur 
at a particular frequency, which is characteristic of these two levels. Thus, the 
intensity of the corresponding line depends on the number of molecules which are 
found in the two levels. 

This information can be used to determine the temperature of a medium*. 
If thermodynamic equilibrium has been attained, we know that the probability 
that a given molecule is in a particular state of energy E t is proportional to 
e~ E ' /kT ; since the degeneracy of the rotational level E t is (21 + 1), the total 
probability ^ of finding the molecule being considered in one of the states of the 
level E, (the “population” of the level E,) is: 

0> x = ^(2l + 1 ) e~ E,lkT 
= 1 ( 2 / + 1 ) e -««+»»/«• 

where: 

oo 

Z = X (2/ + i) e - , <' + 1 >' )B "‘ r 
1 = 0 

is the partition function. If we are studying a system containing a large number 
of molecules whose interactions can be neglected, 0) gives the fraction of them 
whose energy is E v 


(38) 

(39) 



FIGURE 6 


Population & x of the various rotational levels E t at thermodynamic equilibrium. The fact that 
begins by increasing with / arises from the (2/ + 1 )-fold degeneracy of the levels E { . When / becomes 
sufficiently large, the Boltzmann factor t~ Et,kT prevails and is responsible for the decrease in & r 


♦ Actually, the vibration-rotation or Raman rotational spectra are more often used, since they 
fall into more convenient frequency ranges than does the pure rotational spectrum. 
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At ordinary temperatures, hB is much smaller than kT, so that several rota¬ 
tional levels are populated. Note that the presence of the factor (21 + 1) means that 
it is not the lowest levels which are the most populated : figure 6 indicates the 
shape of ^ as a function of / for a temperature T such that hB/kT is of the 
order of 1/10. Recall that the vibrational levels, on the other hand, are non¬ 
degenerate, and that their separation is much greater than hB; consequently, 
when the distribution of the molecules between the two rotational levels is that 
of figure 6, they are practically all in the vibrational ground state (t> = 0). 

b. HOMOPOLAR MOLECULES. RAMAN ROTATIONAL SPECTRA 

As we pointed out in §1 -c-p of complement A v , a homopolar molecule (that 
is, a molecule composed of two identical atoms) has no permanent electric 
dipole moment: in formula (37), we have d 0 = d 1 = ... = 0. The vibration and 
rotation of the molecule induce no coupling with the electromagnetic field, and 
the molecule is consequently “inactive” in the near infrared (vibration) and the 
microwave domain (rotation). Like the vibration ( cf. § l-c-(5 of A v ), the rotation 
of the molecule can, however, be observed via the inelastic scattering of light 
(the Raman effect). 

a. The Raman rotational effect. Classical treatment 

We have already introduced, in complement A v , the susceptibility x of a 
molecule in the optical domain: an incident light wave, whose electric field 
is E e‘ fit , sets the electrons of the molecule in forced motion and causes an 
electric dipole D e ,fi ', oscillating at the same frequency as the incident wave, 
to appear, x is the coefficient of proportionality between D and E. If E is parallel 
to the axis of the molecule, x depends on the distance r between the two nuclei: 
when the molecule vibrates, x vibrates at the same frequency. This is the origin 
of the Raman vibrational effect described in §l-c-|3 of A v . 

Actually, a diatomic molecule is an anisotropic system. When the angle 
between the molecular axis and E is arbitrary, D is not generally parallel to E: 
the relation between D and E is tensorial (x is the “susceptibility tensor”). 
D is parallel to E in the two following simple cases: E parallel to the molecular 



FIGURE 7 

Decomposition of the electric field E into a component E,, 
parallel to the molecular axis and a component E ± per¬ 
pendicular to this axis. These fields induce electric dipoles 
Xii^n &nd / ± E i collinear with the corresponding fields. 
However, since Xu and have different values (the mole¬ 
cule is anisotropic), the induced dipole D — Xtfin + Xi E ± 
is not collinear with E. 
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axis (we then have x — X//X and E perpendicular to this axis (j = x L )- 1° the 
general case, we choose the Oz axis along the electric field E of the light wave 
(assumed to be polarized); we consider a molecule whose axis points in the direction 
of polar angles 6 and (p and calculate the component along Oz of the dipole induced 
by E on this molecule. E can be decomposed into a component E//, parallel to the 
molecular axis, and a component E x , perpendicular to M { M 2 and contained in 
the plane formed by Oz and M t M 2 (fig. 7). The dipole induced on the molecule 
by the field E cos Qt is then equal to: 

D = (*,/E/, + X±E.l) cos Qt (40) 

Its projection onto Oz can be calculated immediately: 

D z = (cos 6 ^/JEJ + sin 6 xjEjJJcos Qt 

= (cos 2 6 Xu + sin 2 6 Xl)E cos Qt (41) 

= Lx, + (x// - Xi) cos 2 6]E cos Qt 

We see that D z depends on 6 , since xp and x± are not equal (the molecule is 
anisotropic). 

To see what happens when the molecule rotates, we shall begin by reasoning 

classically. The fact that the molecule rotates at the frequency — means that 

2n 

cos 6 oscillates at the same frequency: 

cos 6 = <x cos (a> R t — P) (42) 


where a and ft depend on the initial conditions and the orientation of the angular 
momentum if (which is fixed). Thus we see that the term in cos 2 6 of (41) 

^2 ^ 2co 

gives rise to components of D z which oscillate at frequencies of —=--, in 

2n 

Q 

addition to the component which varies at the frequency —. The fact that the 

2k 

rotation of the molecule at the frequency ^ modulates its polarizability at twice 

its frequency is easy to understand: after half a rotation, performed in half a 
period, the molecule returns to the same geometrical position with respect to the 
incident light wave. The light re-emitted with a polarization parallel to Oz is that 

which is radiated by D z . It has an unshifted line of frequency (Rayleigh line), 

2k 


as well as two shifted lines, one on each side of the Rayleigh line, of frequencies 
^ ^°?]L (Raman-Stokes line) and — (Raman-anti-Stokes line). 


2k 


(3. Quantum mechanical selection rules. Form of the Raman spectrum 

Quantum mechanically, Raman scattering corresponds to an inelastic scat¬ 
tering process in which the molecule goes from level E, to level E r , while the 
energy TiQ of the photon becomes TiQ + E, — E r (the total energy of the system 
is conserved during this process). 
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The quantum theory of the Raman effect (which we shall not discuss here) 
indicates that the probability of such a process involves the matrix elements of 
(Xu — X±) cos 2 0 + X± between the initial state Y"(0, (p) and the final state 
Y"'(6, <p) of the molecule: 

/» 

d a <p) [( Z// - Xi) cos 2 6 + Zl ] Y7(6, q>) (43) 

* 


It can be shown, using the properties of the spherical harmonics, that such a 
matrix element is different from zero only if*: 

/' - / = 0, + 2, - 2 (44) 

There is only one Rayleigh line (which corresponds to / = /'). Since the rotational 
levels are not equidistant, there are several Raman-anti-Stokes lines (which 
correspond to /' = / — 2), of frequencies : 


® Iy + 2 Ey _ 
In h 

with /' = 0, 1, 2,... 




and several Raman-Stokes lines (which correspond to /' = / + 2), of frequencies: 


| Ei £)+ 2 

2 it h 

with / = 0, 1, 2,... 


-4 -<*(’*! 


Rayleigh line 



figure 8 

Raman rotational spectrum of a molecule. This molecule, initially in the rotational level E, , inelas- 
tically scatters an incident photon of energy HQ. After the scattering, the molecule has moved to the 
rotational level E v , and the energy of the photon is ftQ 4- E, — E r (conservation of energy). 

If / = /', the scattered photon has the same frequency v = Qjln as the incident photon; this 
yields the Rayleigh tine. But it is also possible to have /' — / — ± 2 ; if /' = / + 2, the frequency of 
the scattered photon is lower (Stokes scattering; if /' — / — 2, it is higher (anti-Stokes scattering). 
Since the rotational levels £ ; are not equidistant (cf. fig. 2), there are as many Stokes or anti-Stokes 
lines as there are values of /. These lines are labeled in the figure by /-> l' (with /' = / ± 2). 


* The integral (43) is also zero if m ^ m\ If we consider light re-emitted with a different polariza¬ 
tion state from that of the incident light wave, we obtain the following selection rules for m\ Am = 0, 

± 1 * ± 2 . 
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The form of the Raman rotational spectrum is shown in figure 8. The Stokes 
and anti-Stokes lines occur symmetrically with respect to the Rayleigh line. 
The separation between two adjacent Stokes (or anti-Stokes) lines is equal to 
4 B, that is, to twice the separation which would be found between two adjacent 
lines of the pure rotational spectrum if it existed. Moreover, since the vibrational 
frequency is much larger than fi, the Raman-Stokes and anti-Stokes vibrational 
lines are situated much further to the left and to the right of the Rayleigh line than 
the rotational Raman lines and hence do not appear in the figure (these lines also 
have rotational structures analogous to that of figure 8). 


comments: 


(0 


Consider a wave packet like those studied in §3-c, that is, one for which 
the values of l are grouped about a very large integer l M (fig. 4). According 
to (45) and (46), the frequencies of the various Stokes and anti-Stokes lines 
will be very close (in relative value) to: 


Q 

Tn ± 4B '“ 


(47) 


that is, according to (31): 

Q , . 

—— + 2v,, 

2n ~ M 


(48) 


where v M is the average rotational frequency of the molecule. Thus, the quantum 
mechanical treatment, at the classical limit, yields the results of §4-b-a. 

(ii) In Raman rotational spectra, the Stokes and anti-Stokes lines appear with 
comparable intensities since levels of large / have large populations, as hB 
is much smaller than kT. This is necessary for the observation of anti-Stokes 
lines, for which the initial state of the molecule must be at least 1—2. 
However, the anti-Stokes vibrational line has a much smaller intensity than 
the Stokes line. The vibrational energy is much larger than kT; the popu¬ 
lation of the vibrational ground state v = 0 is much larger than the others, 
and Stokes processes v = 0 —> v = 1 are much more frequent than anti- 
Stokes processes v ~ 1 —*■ v = 0. 

(iii) The Raman rotational effect also exists for heteropolar molecules. 


References and suggestions for further reading: 

Karplus and Porter (12.1), §7.4; Herzberg (12.4), Vol. I, chap. Ill, §§ 1 and 2; 
Landau and Lifshitz (1.19), chaps. XI and XIII; Townes and Schawlow (12.10), chaps. 1 
to 4. 
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Complement D V i 

ANGULAR MOMENTUM OF STATIONARY STATES 
OF A TWO-DIMENSIONAL HARMONIC OSCILLATOR 


1. Introduction 

a. Review of the classical problem 

b. The problem in quantum mechanics 

2. Classification of the stationary states by the quantum numbers n x and n y 

a. Energies; stationary states 

b. H xy does not constitute a CS.C.O. in & xy 

3. Classification of the stationary states in terms of their angular momenta 

a. Significance and properties of the operator L z 

b. Right and left circular quanta 

c. Stationary states of well-defined angular momentum 

d. Wave functions associated with the eigenstates common to H xy and L z 

4. Quasi-classical states 

a. Definition of the states \ <x x , a y > and \ <x dy ot 9 > 

b. Mean values and root-mean-square deviations of the various observables 


In this complement, we shall be concerned with the quantum mechanical properties 
of a two-dimensional harmonic oscillator. The quantum mechanical problem is 
exactly soluble and does not involve complicated calculations. Furthermore, this 
subject provides an opportunity to study a simple application of the properties 
of the orbital angular momentum L, since, as we shall see, the stationary states 
of such an oscillator can be classified with respect to the possible values of the 
observable L z . In addition, the results obtained will be useful in the next comple¬ 
ment, E vl . 


1. Introduction 

a. REVIEW OF THE CLASSICAL PROBLEM 

A physical particle always moves in three-dimensional space. However, if 
its potential energy depends only on x and y, the problem can be treated in two 
dimensions. We shall assume here that this potential energy can be written: 

V(x,y) = j(o 2 (x 2 + y 2 ) (1) 

where n is the mass of the particle and to is a constant. The classical Hamiltonian 
of the system is then : 

JT = j? xy + (2) 
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with: 

K y = Yu ( p * + $ + 1 /Zt ° 2 ^ 2 + y2>j 



( 3 ) 


where p x , p y , p z are the three components of the momentum p of the particle. M xy is 
a two-dimensional harmonic oscillator Hamiltonian. 

The equations of motion can easily be integrated to yield: 

'Pz(0 = Po 

" z(0 = ^7 t + z 0 (4) 


{ 

{ 


x(t) = x M cos {(Ot - <p x ) 

P x {t) = - P<ox M sin (cot - <p x ) 

f(0 = y M cos (cot - (p y ) 

P y (t) -sin (cot - (p y ) 


( 5 ) 

( 6 ) 


where p 0 , z 0 , x M , (p x ,y M , <p y are constants which depend on the initial conditions (we 
assume x M and y M to be positive). 



Projection of the classical trajectory of a particle in a two-dimensional harmonic potential onto 
the xOy plane; we obtain an ellipse inscribed in the rectangle ABCD. 


We see that the projection of the particle onto Oz describes a uniform motion 
with a velocity of p 0 /p. The projection onto the xOy plane describes an ellipse 
inscribed in the rectangle ABCD of figure 1. The direction the particle takes on this 
ellipse depends on the phase difference <p y — (p x . When <p y — (p x = ± n, the 
ellipse reduces to the line AC. When q> y — <p x is between — n and 0, the particle 
moves clockwise on the ellipse (“left-handed” motion), with the axes of the ellipse 
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parallel to Ox and Oy for <p y — <p x — — n/2. When q> — <p x = 0, the ellipse 
reduces to the line BD. Finally, when <p y — <p x is between 0 and n, the particle moves 
counterclockwise on the ellipse (“right-handed” motion), with the axes parallel to Ox 
and Oy for (p y — <p x = + n/2. Note that the ellipse reduces to a circle 
if (P y ~ <P X = ± nj2 and x M = y M . 

It is easy to determine several constants of the motion related to the projection 
of the motion onto the xOy plane : 

- the total energy ,W xy> which, according to (3), (5), ( 6 ), is equal to: 


Ky = \ + > m ) ( 7 ) 

— the energies: 

( 8 -a) 

p- oj2 yl ( 8 ‘ b ) 

of the projections of the motion onto Ox and Oy ; 

— the component of the orbital angular momentum & of the particle 
along Oz : 

= xp y - yp x (9) 

which, according to (5) and ( 6 ), is equal to: 

<£ z = PO)x M y M sin (<p y - <p x ) ( 10 ) 


We see that <£ z is positive or negative depending on whether the motion is counter¬ 
clockwise (0 < <p y — (p x < n) or clockwise (— n < (p y — <p x < 0). is zero for the 
two rectilinear motions ((p y — <p x = ± n and cp y — q> x = 0). Finally, for a motion 
at a given energy, that is, according to (7), for a fixed value of xh + y^, \^ z \ is 
maximal when <p y — q> x = ± n/2 and the product x M y M is maximal, which 
implies x M = y M . Of all motions at a given energy, it is the counterclockwise 
(clockwise) motion which corresponds to the maximal (minimal) algebraic value 
of . 2 %. 


b. THE PROBLEM IN QUANTUM MECHANICS 

The quantization rules of chapter III enable us to obtain //, H xy , H z from , 
Jf xy , yt’ z . The stationary states |<p> of the particle are given by: 

H\<P> = W xy + H z )\<p) = E\cp> (11) 


with: 


" xy 

H z 


p2 _i_ p2 1 

+ ^ M 2 {X 2 +y2) 

El 

2p 


(12-a) 
( 12 -b) 
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According to the results of complement F,, we know that we can choose a 
basis of eigenstates of H composed of vectors of the form: 

| <P > = | <p xy > ® \<Pz > O 3 ) 

where | <p xy ) is an eigenvector of H xy in the state space £ xy associated with the 
variables x and y : 

H xy\<Pxy> = E xy\<p x y> (14) 

and | <p z > is an eigenvector of H z in the space £. associated with the variable z: 

H z \(p z y = E z \<p z y (15) 

The total energy associated with the state (13) is then: 


E = E xy + E z (16) 

Now, equation (15), which in fact describes the stationary states of a free 
particle in a one-dimensional problem, can be solved immediately; it yields: 


< * | <Pz > 


i Qip z z!h 


(17) 


(where p z is an arbitrary real constant), with: 



(18) 


The problem therefore reduces to the determination of the solutions of equation (14), 
that is, the energies and stationary states of a two-dimensional harmonic oscillator. 
This is the problem we shall now try to solve. 

We shall see that the eigenvalues E xy of H xy are degenerate : H xy alone does 
not constitute a C.S.C.O. in £ rv . We must therefore add one or several other 
observables to H in order to construct a C.S.C.O* In fact, we find in quantum 
mechanics the same constants of the motion as in classical mechanics: H x and H y , 
the energies of the projection of the motion onto Ox and Oy ; and L z , the component 
along Oz of the orbital angular momentum L. Since L z commutes with neither H x 
nor H y , we shall see that a C.S.C.O. can be formed of H xy , H x and H y (§2) or 
°f H xy and L z (§3). 


comments: 

(/) Formula (18) indicates that the eigenvalues E z of H z are all two-fold degenerate in the 
space <f 2 . Furthermore, the degeneracy in £ = & xy ® of the eigenvalues (16) of the 
total Hamiltonian H is not due solely to the degeneracy of E xy in 8 xy and of E z in $.: 
two eigenvectors of H of the form (13) can have the same total energy E without 
their corresponding values of E xy (and of E z ) being equal. 

(») H commutes with the component L z of L, but not with L x and L y . This results from the 
fact that the potential energy written in (1) is rotation-invariant only about Oz . 
Moreover, of the three operators L x , L y and L z , only one, L r , acts only in & xy 
In the study of the two-dimensional harmonic oscillator, therefore, we shall use only 


730 



TWO-DIMENSIONAL HARMONIC OSCILLATOR 

the observable L z . In complement B VI1 , we shall study the isotropic three-dimensional 
harmonic oscillator, whose potential energy is invariant with respect to any rotation 
about an axis which passes through the origin; we shall see that all the components 
of L then commute with the Hamiltonian. 


2. Classification of the stationary states 
by the quantum numbers n x and n y 


a. ENERGIES; STATIONARY STATES 

To obtain the solutions of the eigenvalue equation (14), note that H xy can be 
written: 

H xy — H x + H y (19) 

where H x and H y are both Hamiltonians of one-dimensional harmonic oscillators : 

H > = ^ + l fXO)2Y2 (20) 

We know the eigenstates | <p nx ) of H x in 6" x and the eigenstates | <p „ y > of H y in S y . 
Their energies are, respectively, E x = ( n x + l/2)hwand E y = (n y + 1 /2 fio) (where n x 
and n y are positive integers or zero). The eigenstates of H xy can thus be chosen in 
the form: 

I <Pn x .« v > = I <Pn x > ® | <P» V > (20 


where the corresponding energy E xy is given by : 

E xy = («* + ^ f>a> + (nj, + 

= (n x + n y + l) h(o (22) 

According to the properties of the one-dimensional harmonic oscillator, E x is 
non-degenerate in 6° x , and E y in S y . Consequently, a vector | (p„ xt „ y ) of $ xy which is 
unique to within a constant factor corresponds to a pair { n x , n y ) : H x and H y form 
a C.S.C.O. in S xy . 

It will prove convenient to use the operators u x and a y (destruction operators 
of a quantum, relative to Ox and Oy respectively), defined by: 

a ’ = 7i^ Y+i ^) (23) 
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p= K 

V n 


( 24 ) 


Since a x and a y act in different spaces, S x and & , the only non-zero commutators 
between the four operators a x , a y , a*, a x y , are: 

KX] = K’<|- 1 (25) 

The operators N x (the number of quanta relative to the Ox axis) and N v (the number 
of quanta relative to the Oy axis) are given by: 


N x = a l a x 

N y = a\a y 

which enables us to write H xy in the form: 

H xy = H x + H y = (N x + N y + l) hoy 
We have, obviously: 

| <P„ x>By > = n x | (p„ x , ny > 

Ny I <Pn*,„ y > = "y I > 

The ground state | <p 0 0 ) is given by: 
|<P0,0> = \<Pn x= 0> ® \<Pn y = 0> 


(26) 

(27) 

(28) 
(29) 

The state | <p „ x „ y ) defined by (21) can be obtained from |<p 0 ,o> by the successive 
application of the operators a* and a y : 


*Pn x ,n y y 


r 


V n x l n y \ 


:(4)"» (ap"* I <p 0 ' 0 > 


(30) 


The corresponding wave function is the product of <p„ x (x) and <p„ (>’) \cf. comple 
ment B v , formula (35)]: 

l 


<Pn x ,Jx, y) = 


vn(2) nx+n in x )\ (n y )! 




(31) 


b. H xy DOES NOT CONSTITUE A C.S.C.O. IN S„ 

We see from (22) that the eigenvalues of H are of the form: 

E xy = E„ = (n+\)hoy (32) 

where: 

n = n x + n y (33) 

is a positive integer or zero. To each value of the energy correspond the various 
orthogonal eigenvectors: 

I ^ n x = n,n y = 0 ( *Pn x = n — 1 ,n y = 1 ^ » •••> | *Pn x = 0 ,My =n ^ (34) 
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Since there are (n + 1) of these vectors, the eigenvalue E„ is (« + 1 )-fold degenerate 
in 6 . H xy alone does not, therefore, constitute a C.S.C.O. On the other hand, 
we have seen that { H x , H y } is a C.S.C.O.; this is also, obviously, true of { H xy , H x } 
and { H„, H, }. 


3. Classification of the stationary states 
in terms of their angular momenta 

a. SIGNIFICANCE AND PROPERTIES OF THE OPERATOR L, 

In the preceding section, we identified the stationary states by the quantum 
numbers n x and n y . But the Ox and Oy axes do not enjoy a privileged position in 
this problem. Since the potential energy is invariant under rotation about Oz, 
we could just as well have chosen another system of orthogonal axes Ox' and Oy' 
in the xOy plane; we would have then obtained stationary states different from the 
preceding ones. 

Therefore, in order to take better advantage of the symmetry of the problem, 
we shall now consider the component L z of. the angular momentum, defined by: 


L z = XP y - YP X (35) 

Expressing X and P x in terms of a x and a x , and Y and P y in terms of a y and a*, we get: 

L z = iTi{a x a; - a' x a y ) (36) 

Now, the expression for H xy in terms of the same operators is: 

H xy = (a x x a x + a' y a y + 1) Tuo (37) 

Since: 

l“x a h a l a * + a y a y] = a * a l ~ a * a l = 0 

I a}fl y , a' x a x + = - a* x a y + a\a y = 0 (38) 

we find that: 

[ff„,L,]= 0 (39) 


We shall therefore look for a basis of eigenvectors common to H xy and L z . 


b. RIGHT AND LEFT CIRCULAR QUANTA 

We introduce the operators a d and a g defined by: 
= ~j= ( a x - ia y ) 

a 9=~q ( a * + ia y ) 


(40) 
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We see from this definition that the action of a d (or a g ) on | (p„ Xj „ y ) yields a state 
which is a linear combination of | <P„ x ~i ,„ y ) and | ip„ x ny _ l >, that is, a stationary 
state which has one less energy quantum Tio). Similarly, the action of a d (or a' g ) 
on | (p„ xi „ y > yields another stationary state which has one more energy quantum. 
In fact, we shall see that a d (or a g ) is analogous to a x (or a y ), and that a d and a g can 
be interpreted as being destruction operators of a right and left “circular quantum” 
respectively. 

First of all, using (40) and (25), it is simple to verify that the only non-zero 
commutators between the four operators a d , a g , a d , a g are: 

[«*»«]] = [<*,.«;] = 1 ( 41 ) 

These relations are indeed analogous to (25). Moreover, H xy can be written, 
in terms of these operators, in a way that is similar to (37); since: 


a l a j = \ (“X + a\a y - ia' x a y + ia x a' y ) 

a l a 9 = ~2 (“* a * + a l a y + ia ' a y ~ ia A) ( 42 ) 

we have: 

H xy = (a' d a d + a\a g + 1) ha) (43) 

In addition, using (36), we see that: 

L z = Hala d - a\a g ) (44) 

If we introduce the operators N d and N g (the number of right and left “circular 
quanta”): 

N d = a]fl d 

N g = ai g a g (45) 

formulas (43) and (44) become: 

H xy = (N d + N g + \)fuo 

L z =h{N d - N g ) (46) 

Thus, while maintaining H in a form as simple as (27), we have simplified that 
of L z . 


c. STATIONARY STATES OF WELL-DEFINED ANGULAR MOMENTUM 

Using the operators a d and a g , we can now go through the same arguments 
we used for a x and a y . It follows that the spectra of N d and N g are composed of all 
positive integers and zero. In addition, specifying a pair { n d , n g } of such integers 
determines uniquely (to within a constant factor) the eigenvector common to N d 
and N g , associated with these eigenvalues, which is written: 

I Xn a .n 9 > = ~7== - | <Po,0 ) (47) 

v'(rt d ) ! (» g Y- 
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N d and N g therefore form a C.S.C.O. in i xy . Thus we see, by using (46), that | x„ d%ng ) 
is also an eigenvector of H and of L 2 , with the eigenvalues (n + l)Tuo and mft, 
where n and m are given by: 

n = n d + n g 

m = n d - n g (48) 

Equations (48) enable us to understand the origin of the name of right or left 
“circular quanta”. The action of the operator a' d on \x„ d ,„ g } yields a state with 
one more quantum, to which, since m has increased by one, an additional angular 
momentum + ft must be attributed (this corresponds to a counterclockwise 
rotation about Oz). Similarly, a* g yields a state with one more quantum, of angular 
momentum — ft (clockwise rotation). 

Since n d and n g are positive integers (or zero), our results are in agreement 
with those of the preceding section : the eigenvalues of H xy are of the form (n + 1 )fta>, 
where n is a positive integer or zero; their degree of degeneracy is (n + 1 ) since, 
for fixed «, we can have : 

n d = n ; n g = 0 

n d = n - 1 ; n g = 1 

n d = 0 ■ n g = n (49) 

Furthermore, we see that the eigenvalues of L z are of the form nil i, whe.e m 
is a positive or negative integer or zero, which is the result that was established 
for the general case in chapter VI. In addition, table (49) tells us which values of m 
are associated with a given value of n. For example, for the ground state, we 
have n d = n g = 0 , and therefore, necessarily, m = 0 ; for the first excited state, we 
can have n d = 1 and n g = 0 ,or n d = 0 and n g = 1 , which yields either m = + 1 
or m = — 1. In general, formulas (48) and (49) show that, for a given energy 
level (>/ + l fioj, the possible values of m are: 

m = m, n — 2, n — 4, ..., — n + 2, — n (50) 

It follows that, to a pair of values of n and m , there corresponds a single vector (to 
within a constant factor): 

I v n + m n~ m \ 

I An d n 9 = —^~ / 

H and L z therefore form a C.S.C.O. in S xy . 


comment : 

For a given value of the total energy (labeled by n), the states | X„ d = n ,„ 9 =o> 
and |Zn d = on 9 =n) therefore correspond to the maximal (nft) and minimal 
(-nft) values of L z . These states therefore recall the classical right and 
left circular motions associated with a given value of the total energy, for 
which & z takes on its maximal and minimal values (see § 1 -a). 
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d. WAVE FUNCTIONS ASSOCIATED WITH THE EIGENSTATES 
COMMON TO H xy AND L 7 

To conserve the symmetry of the problem with respect to rotation about Oz , we shall 
use polar coordinates, setting: 

x = p cos <p p > 0 

y = p sin <p 0 < cp < 2n (51) 

Now, what is the action of the operators a d and a g on a function of p and <p? We shall begin by 
determining their action on a function of x and y . Knowing that of X and P x and therefore that 
of a x (and, by analogy, that of a y ), we can use (40), which yields: 


IT x \ (d 8Y 


According to the rules for differentiating functions of several variables, we then obtain: 


Similarly: 


2 r dp Pp d(p\ 


2 [ SP P 


P dp pp d(p 


e*r Id i dl 

TL fr + Wr + TrS;\ 

e~ t * r _ \_d_ j_jT| 

2 pdp + p P dcp J 


To calculate the wave functions X Kd , ng {P> <p\ simply apply the differential operators 
which represent a d and a' g to the function Xo,o(P> <?)> which is, according to (31): 

Xo,o(P> (p) = ~ 7 = e~ p2fi2 ' 2 (56) 


Now it can be seen from (54) and (55) that the action of aj (or of a*) on a function of the 
form e im<p F(p) is given by: 






Through the repeated application of these relations to (56), we see that the (^-dependence 
of Xn d ,n g (P> <P) i s simply given by: This is a general result, established in chapter VI : 

the (^-dependence of an eigenfunction of L z of eigenvalue mft is e ,rn<p . 
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If, in (57), we choose F(p) = p m e ^ 2/2 , then: 

a t[ e im<yn e - 0 V/ 2 ] _ p e Hm+l)<p p m+l e -fi 2 p*/2 

Applying the operator a\ to the function Xo t o(p) n d times, we obtain 
(A <P) -4= e iKiq, {ppT d e ‘^ V/2 

v 7t(n d )! 

An analogous calculation yields: 

Xo,n,(P . </>) = - 7 == e _i "» ,> (/?p)' , » e-W 2 

v M n g)- 

These wave functions are normalized. For a given energy level (« + l)fia>, the wave 
functions (59) and (60) correspond to the limiting values + n and — n of the quantum 
number m . Their p-dependence is particularly simple: their modulus reaches a maximum 

for p = \/n/p. Therefore (as in the case of a one-dimensional harmonic oscillator), the 
spatial spread of these wave functions increases with the energy (n + l)ho) with which they 
are associated. 

In the same way, application of the operators a\ (or a*) to (59) and (60) permits the 
construction of the functions Xn d ,n g (P* <p) f° r an Y n d an d n r The results obtained for the first 
excited levels are given in table I. 


(58) 

(59) 

(60) 


n = 0 


n = 1 


m = 0 


Xo,o(p) 



(L- Rn P-0V/2 pi* 


m = 1 Xi.o(P> <p) = —f=PP e f fi e 1 

V n 


= ~ 1 Xo.i(A (p)=~^=Ppe * 2fi212 e * 



X 2 .o(P > <P) = “ 7 = {Ppf e " 2 '’ J/2 e 21 ” 

V 271 

,i,(P.,)= 4= Pp) 2 -l]e— 2 

V 


Xo. 2 (p,9>) = -7=(M 2 e-^ 2/2 e- 2 '> 

v 2 tt 


TABLE I 


Eigenfunctions common to the Hamiltonian and the observable L z , for the first levels of the 
two-dimensional harmonic oscillator. 
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comment: 

The functions x„ a ,o (P> <p) given in (59) are proportional to e~ fi2fi2/2 (fip c iv ’f d . 
More generally, all their linear combinations are of the form: 

F(p, q>) = e~ f2pl/2 f(fip e‘ > ) (61) 

(where / is an arbitrary function of one variable) and are eigenfunctions of N g with the 
eigenvalue zero. It can easily be shown from (55) that: 

a g F(p,<p) = 0 (62) 

Similarly, the subspace of eigenfunctions of N d of eigenvalue zero is composed of 
functions of the form: 

G(p,q>) = c-» 2p2l2 g(^e-^) (63) 


4. Quasi-classical states 

Using the properties of the one-dimensional harmonic oscillator, we can easily 
calculate the time evolution of the state vector and the mean values of the various 
observables of the two-dimensional oscillator. For example, it is not difficult to 
show that in the mean values < X) (f) and < F>(f), as well as < P x }{t) and < P y >(t), 
only the Bohr frequency to appears. Moreover, it can be shown that these mean 
values exactly obey the classical equations of motion. In this section, we shall 
be concerned with the properties and evolution of the quasi-classical states of the 
two-dimensional harmonic oscillator. 

a. DEFINITION OF THE STATES | a„ a y > AND \ a d . a g > 

To construct a quasi-classical state of the two-dimensional harmonic oscil¬ 
lator, we can base our reasoning on the one-dimensional oscillator (cf. comple¬ 
ment G v ). Recall that, in a quasi-classical state associated with a given classical 
motion, the mean values <^>(0 and </*)(/) coincide at each instant with ,v(/) 
and p(t). Similarly, the mean value of the Hamiltonian H is equal (to within a half¬ 
quantum Tuo/l ) to the classical energy. We showed in complement G v that, at any 
time, the quasi-classical states are eigenstates of the destruction operator a and can 
be written: 

I « > = Z c„(a) | <p n > (64) 

n 

where a is the eigenvalue of a, and: 

<■„(«) = -^=.e- |a|2/2 (65) 

v'n! 

In the case which concerns us here, we can use the rules of the tensor product 
to obtain the quasi-classical states in the form: 

'X X 

!«,<«,> = K> ® K> = Z Z C J a *) C n,K) I <Pn x .n y > ( 66 ) 

n x — 0 n y — 0 
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with: 

a x I | <V«y> 

a y | a x , a y > = a y | a x , a y > (67) 


We are then sure that ( X}, < P x >, < H x >, < 7), < P y ), (H y y are the same as the 
corresponding classical quantities. Now, returning to definition (40) and using (67), 
we see that: 


a d | a*, a, > = a d | a*, ct y > 

| y | ^ 


with: 


a d = “ '«>) 


(“* + »'*,) 


( 68 ) 


(69) 


Therefore, the state | a x , a y ) is also an eigenvector of a d and a g with the eigenvalues 
given in (69). We shall denote by |a d , a 9 > the eigenvector common to a d and a g 
associated with the eigenvalues a d and u g . It is easy to show that the expansion 
of | a d , a 9 > on the { | x„ d .n t ) } basis has the same form as that of | a x , a y > on the 
{ I Vn x ,n y > } basis : 

X X) 

I a d' *9 > = X I c n>d) C n ,(« 9 ) | Xn d ,n 9 > (70) 

n d — 0 n g = 0 


where the coefficients c n are given by (65). It follows from (68) and (69) that: 




> = I «d = 


i a 




i a = 


. + ia. y 

Si 


(71) 


Because of the properties of the states | a ) (c/ complement G v , §3-a), we see 
that if: 

|*(0)> = |a„«,> = |a d ,« 9 > (72) 

the state vector at the instant t will be : 

|*(/)> - e“ io " |e- to, «„e- to ‘a,> 

= e- ia " |e- i<u 'a d ,e- , '“ , a 9 > (73) 


b. MEAN VALUES AND ROOT-MEAN-SQUARE DEVIATIONS 
OF THE VARIOUS OBSERVABLES 

We set: 

= kl e ’ >v 

a y = k| e‘>v (74) 
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Using formulas (93) of complement G v , we obtain: 


< X >(0 = -j- \u x \ cos (ojt - (p x ) 


< y > (0 = -j- kl cos (a >t - <p y ) 


<P x >(t)= ~ P M ~jf kl sin (tot - <p x ) 

I Jl 

= ~ P0} -Jf H Sil1 (°* “ «»,) 

Comparing (75) and (76) with (5) and (6), we see that: 

_ P X M iy x 

X Si 


a Py»i 


where y M , <p y are the parameters defining the classical motion which the 

state | a x , > best reproduces. 



Also: 










<n x } = 

kl 2 









<K y } = 

kr 







(78) 

and: 











<H d ) = 

ki 2 = 

1 

= 2 

IK 

l 2 + 

kl 2 

+ i(ct x ct* - 

- a*a,)] 



<H g > = 

kl 2 = 

1 

: 2 

IK 

l 2 + 

kl 2 

- i(a x ct* - 

- aja,)] 

(79) 

that 

is, according to (46): 








* 

vs 

x/ 

II 

= Tiu) (|i 


1 + 

kl 2 

+1) 

= Koi (|a d 

r + ki 2 + o 

(80) 

and: 











<L Z > = 

2»|«,| 

kl 

sin 

(<P> - 

- <p x ) 

= Mkl 2 

- kl 2 ) 

(81) 


According to (77), < L z ) is the same as the classical value of !£ z [formula (10)]. 

Now let us consider the root-mean-square deviations of the position and 
momentum and then of the energy and angular momentum in a state | <x x , a^). 
Directly applying the results of complement G v , we obtain: 


AX = AY 


pSi 


AP r = AP„ = 


740 




TWO-DIMENSIONAL HARMONIC OSCILLATOR 


The root-mean-square deviations of the position and momentum are independent 
of a x and <x y ; if |aj and |a | are much greater than 1, the position and momentum 
of the oscillator have a very small spread about < X >, < T> and </**>, < P y >. 
Finally, let us calculate the root-mean-square deviations AH xy for the energy 


and AL Z for the angular momentum. As in complement G v : 

AN x = H 

AN y = jaj 

AN d = Kl 

AN a = |aj (83) 

But the Hamiltonian H xy involves N = N x + N y , and L z is proportional to — N g . 
We must now calculate, for example : 

(AN) 2 = <(N X + N y ) 2 }-<(N x + N y ) ) 2 

= (AN X ) 2 + (AN y ) 2 + 2[< N x N y > - < N x > < N, >] (84) 

According to (66), the state of the system is a tensor product, which means that 
the observables N x and N y are not correlated: 

< N x N y > = < N x > < N y > (85) 

It follows that: 

(AN) 2 = (dAg 2 + (AN y ) 2 (86) 

that is: 

AH xy = Tio) s/\a x \ 2 + \<x y \ 2 = h(Os/\a d \ 2 + |aj 2 (87) 

Similarly: 

AL Z = h s/ja d | 2 + |aj 2 = + la^ 2 (88) 
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A CHARGED PARTICLE IN A MAGNETIC FIELD: 
LANDAU LEVELS 


1. Review of the classical problem 

a. Motion of the particle 

b. The vector potential. The classical Lagrangian and Hamiltonian 

c. Constants of the motion in a uniform field 

2. General quantum mechanical properties of a particle in a magnetic field 

a. Quantization. Hamiltonian 

b. Commutation relations 

c. Physical consequences 

3. Case of a uniform magnetic field 

a. Eigenvalues of the Hamiltonian 

b. Study of the observables in a particular gauge 

c. Study of the stationary states 

d. Time evolution 

Thus far, we have been considering, for various special cases, the properties of 
a particle subjected to a scalar potential F(r) (representing, for example, the effect 
of an electric field on a charged particle). Chapter V (the harmonic oscillator) 
and chapter VII (particle subjected to a central potential) treat other examples of 
scalar potentials. Here we shall be concerned with a complementary problem, 
that of the properties of a particle subjected to a vector potential A(r) (a charged 
particle placed in a magnetic field). We shall encounter a number of purely quantum 
mechanical effects, such as equally spaced energy levels in a uniform magnetic 
field (Landau levels)*. Before studying the problem from a quantum mechanical 
point of view, we shall rapidly review some classical results. 


1. Review of the classical problem 

a. MOTION OF THE PARTICLE 

When a particle of position r and charge q is subject to a magnetic field B(r), 
the force f exerted on it is given by the Lorentz force: 

f = v x B(r) (1) 

where: 


* This equal spacing is, as we shall show, a consequence of the properties of the harmonic oscil¬ 
lator, and it could have been treated in chapter V. However, we shall also see that the properties of angular 
momentum are useful in the study and classification of the stationary states of the particle. This is why 
this complement follows chapter VI. 


742 


A CHARGED PARTICLE IN A MAGNETIC FIELD 


is the velocity of the particle. Its motion obeys the fundamental law of dynamics: 



( 3 ) 


(where jx is the mass of the particle). 

In the rest of this complement, we shall often be considering the case in which 
the magnetic field is uniform; we shall choose its direction along the Oz axis. 
By solving the equation of motion (3), one can show that in this case the three 
coordinates .v(t), y(t) and z(t) of the particle are given by: 

-Xr(t) = X 0 + (T cos ( (D c t — (p 0 ) 

,y(0 = >>o + <* sin (c o c t - <p 0 ) (4) 

z{t) = v 0z t + z 0 


where x 0 , y 0 , z 0 , a, (p 0 and v 0z are six constant parameters which depend on the 
initial conditions; the “cyclotron frequency” o o c is given by: 



( 5 ) 


Equations (4) show that the projection of the position M of the particle onto 
the xOy plane performs a uniform circular motion, of angular velocity <x> c and 
initial phase <p 0 , on a circle of radius a whose center is the point C 0 , with coordi¬ 
nates x 0 and y 0 . The motion of the projection of M onto Oz is simply rectilinear 
and uniform. It follows that the particle moves in space along a circular 
helix (cf. fig. 1), whose axis is parallel to Oz and passes through C 0 . 



FIGURE 1 

Classical trajectory of a charged par¬ 
ticle in a uniform magnetic field paral¬ 
lel to Oz. The particle moves at cons¬ 
tant velocity along a circular helix 
whose axis, parallel to Oz, passes 
through the point C 0 . The figure is 
drawn for q < 0 (the case of the elec¬ 
tron), that is, co c > 0. 
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If we are concerned only with the motion of the point Q, the projection of M 
onto the xOy plane, we study the behavior of the vector: 

p = xe x + ye y (6) 

(where e A and are the unit vectors of the Ox and Oy axes). The velocity of Q is : 

d P ™ 


V = —- 

1 dt 


It is therefore convenient to introduce the components x' and y' of the vector C 0 Q: 


x = x — x n 


y = y - y 0 


Since Q performs a uniform circular motion about C 0 , we have: 

v± = a>c e z X C 0 Q (9) 

(where e z is the unit vector of Oz). This implies that the coordinates x 0 and y 0 
of C 0 are related to the coordinates of Q and to the components of v x by: 


y„ = y+-v. 


b. THE VECTOR POTENTIAL. THE CLASSICAL LAGRANGIAN AND HAMILTONIAN 

To describe the magnetic field B(r), one can use a vector potential A(r) which 
is, by definition, related to B(r) by: 

B(r) = V x A(r) (11) 

For example, if the field B is uniform, one can choose: 

A(r)=-irxB (12) 

We know, furthermore, that when B(r) is given, condition (11) does not deter¬ 
mine A(r) uniquely : a gradient of an arbitrary function of r can be added to A(r) 
without changing B(r)*. 

It can be shown (cf. appendix III, §4-b) that the Lagrange function £T(r, v) 
of the particle is given by: 

if(r, v)=^v 2 + q v. A(r) (13) 

It follows that p, the conjugate momentum of the position r, is related to v and A(r) 
by: 

p = V v J>, v) = n\ + </A(r) (14) 


* For example, for a uniform field parallel to Oz , one could choose, instead of the vector A(r) 
given by (12), the vector whose components are A x = 0, A y — xB , A z = 0. 
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The classical Hamiltonian Jf (r, p) is then: 


•^(T p ) = 2^[P~ ^A(r)] 2 

(15) 

It will prove convenient to set: 


.r(r, p) = jf L (] r, p) + JT # (r, p) 

(16) 

with: 


(T P) = -^ { [Px “ <?A»] 2 + [p y - qA y (r)] 2 } 

P) = 2 “ VPz ~ 4A z (r)Y 

(17) 


comment : 

In this case, unlike that of a scalar potential V(r), relation (14) shows 
that the momentum p is not equal to the mechanical momentum pv. Also, 
comparing (14) with (15), we see that is equal to the kinetic energy p\ 2 /2 of 
the particle; this results from the fact that since the Lorentz force written in (1) 
is always perpendicular to v, it does no work during the motion. Similarly, it 
must be noted that the angular momentum: 

& = r x p (18) 

is different from the moment of the mechanical momentum pv: 

k = r x /iv (19) 

c. CONSTANTS OF THE MOTION IN A UNIFORM FIELD 

Consider the special case in which the field B is uniform. The motion of the particle (§l-a) 
is such that Jt// and , defined in (17), are constants of the motion*. 

If we substitute (14) into (10), we obtain: 

x 0 = x-— [p y - qA y (r)] 

Wc ( 20 ) 

y 0 = y + — [Px - <sM»] 

na> c 

It follows that the radius tr of the helical trajectory satisfies: 

(T 2 = (x - x 0 ) 2 + (y - y 0 ) 2 = ] { [p„ - qA y { r)] 2 + [p x - qA x ( r)] 2 } 

^ c/ ( 21 ) 


a 1 is therefore proportional to the Hamiltonian 

* This follows from the fact that, according to (14) and (17), and Jf f , are equal, respectively, to 
the kinetic energies fivl/2 and \iv\! 2 associated with the motions which are perpendicular and parallel to Oz . 
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Similarly, let 0 be the moment of the mechanical momentum jxv with respect to the 
center C 0 of the circle: 

0 - C 0 M x n v (22) 

The component 0 Z of this moment can then be written, with (20) taken into account: 

o z = n [(* - xoh - (y - yok] 

= -77 { [p, — 9k ( r )] 2 + [Px - «4»] 3 } = — JTj. (23) 

0 Z is therefore a constant of the motion, as might have been expected. On the other hand, 
the component X z of the moment of the mechanical momentum /xv with respect to O is not, in 
general, constant, since: 

K = 9* + p[^o^(<) - y o ®*(0] ( 24 ) 

Therefore, according to (4), X z varies sinusoidally in time. 

Finally, consider the projection !£ z onto Oz of the angular momentum -Sf: 

= X Py - yPx (25) 

According to (14), it can be written: 

<£ z = x[n v f + <?/!»] - y[n v x + qA x {t)'\ (26) 

It therefore depends explicitly on the gauge chosen, that is, on the vector potential A picked 
to describe the magnetic field. In most cases, S£ z is not a constant of the motion. Nevertheless, 
if one chooses the gauge given in (12), one obtains from (4): 

^ = ^(4 +yl - <r 2 ) (27) 

z is then a constant of the motion. 

Relation (27) does not have a simple physical interpretation, since it is valid only in a 
particular gauge. However, it will prove useful to us in the following sections for the 
quantum mechanical study of the problem. 


2. General quantum mechanical properties 
of a particle in a magnetic field 

a. QUANTIZATION. HAMILTONIAN 

Consider a particle placed in an arbitrary magnetic field described by the 
vector potential A(x, y, z ). In quantum mechanics, the vector potential becomes 
an operator, a function of three observables, X, Y and Z. The operator H, the 
Hamiltonian of the particle, can be obtained from (15): 

H - 2; [P - ?A(X, K Z)] ! (28) 

According to (14), the operator V associated with the velocity of the particle is given 
by: 

- qA(X,Y,Z)-] (29) 
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which enables us to write H in the form: 



2 


(30) 


b. COMMUTATION RELATIONS 


The observables R and P satisfy the canonical commutation relations: 
[X, P,] = [V, />J = [Z, P = ] = it, (31) 


The other commutators between components of R and P are zero. Two components 
of P therefore commute. However, we see from (29) that the same is not true 
for V; for example: 

IX, Vy] = - 4 {IA>^( R )] + IA(R)< Pyl } (32) 

n 


This expression is easy to calculate, using the rule given in complement B„ [cf. for¬ 
mula (48)]: 


IK. FJ = 4 




(33-a) 


Similarly, it can be shown that: 


IKK] =^B,(R) 

(33-b) 



LK.K] 

n 

(33-c) 


The magnetic field therefore enters explicitly into the commutation relations for the 
velocity. 

However, since A(R) commutes with X, Y and Z, relation (29) implies that: 
[*. K1 = J, [X. PJ = j (34-a) 

and, similarly: 

[Y,V y ] = [Z,Vj=j (34-b) 

(the other commutators between a component of R and a component of V are zero). 
From these relations, it can be deduced ( cf. complement C m ) that: 

AX.AV x >j-^ (35) 

(with analogous inequalities for the components along Oy and Oz). The physical 
consequences of the Heisenberg uncertainty relations are therefore not modified 
by the presence of a magnetic field. 
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Finally, let us calculate the commutation relations between the components 
of the operator: 

A = n RxV (36) 

associated with the moment with respect to O of the mechanical momentum*. 
We obtain: 

IA.O - f[yv, - zv y , zv, - XK] 

= h 2y { [K,z\K + z[K.K ]! - h 1 z 1 [K-K'i 

+ fl 2 X{Z[V,,y z ] + [Z, y 'WI (37) 
that is, with (33) and (34) taken into account: 

[A x ,A y ] = if> { ~ pYV x + qYZB y + qZ 2 B z + qXZB x + nXV y } (38) 

It follows that: 

[A x ,A y ] = ih{A z + qZR. B(R) ) (39) 

(the other commutators can be obtained by cyclic permutation of the indices x, y 
and z). When the field B is not zero, the commutation relations of A are completely 
different from those of L. The operator A therefore does not, a priori , possess the 
properties of angular momenta proved in chapter VI. 

c. PHYSICAL CONSEQUENCES 

a. Evolution of < R ) 

The time variation of the mean position of the particle is given by Ehrenfest’s 
theorem: 

= < [ R ,«] > = < ^ R ,f v2 > (40) 

[according to formula (30)]. Equations (34)arenot difficult to interpret, since, substi¬ 
tuted into (40), they yield: 

j i <R) = (V) (41) 

As in the case in which the magnetic field is zero, the mean velocity is therefore equal 
to the derivative of < R ). Equation (41) is the quantum mechanical analogue of (2). 

P. Evolution o/< V >. The Lorentz force 

Let us calculate the time derivative of the mean value < V ) of the velocity: 

*ft^< v> = < V,fv’]> (42) 

* Of course, the components of the angular momentum L = R x P always satisfy the usual 
commutation relations. 
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Since, according to relations (33): 

[V 2 , FJ = [F 2 + F 2 + F 2 , Fj 

= KIWI + tn.KjK, + K[V Z ,V X ] + [F 2 ,FJF z 

= ^{ ~ ^A(R) - B z (R)V y + KB y ( R ) + } (43) 

r 

it is easy to see that: 

/<|<v> = < F ( R ’ V )> ( 44 ) 

where the operator F(R, V) is defined by: 

F(R ,V) = | { V x B(R) - B(R) x V } (45) 

The last two relations are simply the analogues of the classical relations (1) and (3). 
Here, we obtain a symmetrized expression for F(R, V) (cf. chap. Ill, § B-5), since R 
and V do not commute. 


y. Evolution of < A ) 

Now let us evaluate: 

tf^< A> = <[A,H]> (46) 

To do so, let us calculate, for example, the commutator [XV y — YV X , //]: 

[XV y - YV X ,H ] = X[V y ,H] +lX.H]V y - Y[V X ,H] - [Y. H]V X 


= ^(XF y - YF X ) + ih(V x V y - V y V x ) (47) 

But X and V y commute, as do Y and V x . The commutator we are calculating is 
therefore equal to: 

[V y X - V X Y,H ] = V y [X,H~\ +\_V y M]X - V X [Y,H ] - [V X ,H]Y 

= j(F y X - F X Y) + ih(V y V x - V x V y ) (48) 

Taking half the sum of these two expressions, we find < A z > in the form: 


f ( < A z > = j < XF y - YF X - F X Y + F y X > (49) 

Analogous arguments give the derivative of < A x > and < A v >; finally: 

A < A > = 4 < R x F(R ,V) - F(R ,V) x R > (50) 

df l 
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The classical analogue of this relation is: 

ix = rxf(r,v) (51) 

which expresses a well-known theorem: the time derivative of the moment of 
the mechanical momentum with respect to a fixed point O is equal to the 
moment with respect to O of the force exerted on the particle. 


3. Case of a uniform magnetic field 


When the magnetic field is uniform, the preceding general study can easily 
be pursued further. We choose the direction of the field B as the Oz axis. The 
commutation relations (33) then become, using definition (5): 


\V x <Vy~\ = ~ i-^f (52-a) 

IK,V Z ] = [V z ,V x -]=0 (52-b) 


COMMENT : 


Applying the results of complement C to V x and V y , we can see from 
(52-a) that their root-mean-square deviations satisfy: 


AV x .AV y 


> 1 M 

" 2n 


(53) 


The components of the velocity V ± are therefore incompatible physical 
quantities. 


a. EIGENVALUES OF THE HAMILTONIAN 

By analogy with (16), H can be written in the form: 


H = H ± + H„ 

(54) 

with: 

H x =j(V} + V?) 

(55-a) 

H„ = f l- 

(55-b) 

According to (52-b): 

[Hi. H/J = 0 

(56) 


We can now look for a basis of eigenvectors common to H L (eigenvalues E x ) and 
H lt (eigenvalues E t/ ); they will automatically be eigenvectors of H, with the 
eigenvalues: 

E= E x + E„ (57) 
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a. Eigenvalues of H /f 

The eigenvectors of the operator V z are also eigenvectors of H jt . Now, Z 
and V z are two Hermitian operators which satisfy the relation: 

[Z, V,] - ^ (58) 

We can therefore apply to them the results of complement E„; in particular, 
the spectrum of V z includes all the real numbers. 

Consequently, the eigenvalues of H h are of the form: 


E 


it 



(59) 


where v z is a real arbitrary constant. The spectrum of H h is therefore continuous : 
the energy E /f can take on any positive value or zero. 

The interpretation of this result is obvious : H tl describes the kinetic energy 
of a free particle moving along Oz (as in classical mechanics; § 1-a). 


p. Eigenvalues of H L 

We shall assume, for example, that the particle under consideration has a 
negative charge q; the cyclotron frequency a> c is then positive [formula (5)]*. 
We set: 


6 = /— V 

y V n ° } c y 

s = / ^ V 

V *a>c x 


(60) 


Relation (52-a) can then be written: 

[Q, 5] = / 


(61) 


and H l becomes: 

= -y* (Q 1 + S 1 ) (62) 

Hi, then takes on the form of the Hamiltonian of a one-dimensional harmonic 
oscillator [cf chap. V, relation (B-4)]. Q and S, which satisfy (61), play the roles 
of the position X and momentum P of this oscillator. 

The arguments set forth in §B-2 of chapter V for the operators X and P 
can be repeated here for Q and S. For example, it can easily be shown that 
if | <p x > is an eigenvector of H L : 

Hj. | <P± > = E l | <p L > (63) 


♦ For a positive charge q y one can keep the convention of positive a> c by choosing the direction 
of the Oz axis opposite to the magnetic field. 


751 




COMPLEMENT E 


thekets: 


1 (p'l > = —7= {Q + *S) 1 ‘Pi ) 

V 2 

(64-a) 

|<pl> =—,{Q - iS) | (p L > 

V2 

(64-b) 

are also eigenvectors of H x : 


//iki) = (E ± - fico c ) | <p' ± > 

(65-a) 

H ± | cp'[ > = (E ± + ho) c ) | q>'[ > 

(65-b) 

From this we deduce that the possible values of £± are given by: 


E ± =(n + ^\ha) c 

(66) 


where n is a positive integer or zero, 
y. Eigenvalues of H 

According to the preceding results, the eigenvalues of the total Hamiltonian 
H are of the form: 

E(n,v z ) = + ^T>a> c +^nv 2 z (67) 

The corresponding levels are called Landau levels. 

For a given value of v z , all possible values of n (positive integers or zero) 

actually found. By the repeated action of the operators —-(Q + iS) on an eigen- 

V2 

vector of H of eigenvalue E(n, > z ), one can obtain an energy state E(n\ v 2 ) 

A A 

from (65). where n' is any integer but where v z has not changed (since Q and S 
commute with H /f ). Therefore, although the energy of the motion along Oz is 
not quantized, that of the motion projected onto xOy is. 


COMMENT: 


We showed in chapter V (§ B-3) that the energy levels of the one¬ 
dimensional harmonic oscillator are non-degenerate in S x . The situation is 
different here, since the particle under study is moving in three-dimensional 


1 » u 
space. Since —— (O + iS) = i~— 

Vi 


(V y + iV x ) is the destruction operator of 


a quantum Tko c , the eigenvectors of H L corresponding to n = 0 are solutions 
of the equation: 


{V y + iV,) \(P} = 0 


( 68 ) 
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On the one hand, vectors which are solutions of (68) can be eigenvectors of 
H jt with an arbitrary (positive) eigenvalue. On the other, even for a fixed value 
of v z , equation (68) is a partial differential equation with respect to x and y 
which has an infinite number of solutions. The energies E(n = 0, v z ) are 
therefore infinitely degenerate. By using the creation operator for a quantum, 
it can easily be shown that this is true for all the levels E(n, v z ), for any n 
(non-negative integer). 


b. THE OBSERVABLES IN A PARTICULAR GAUGE 


In order to state the above results more precisely, we shall calculate the 
stationary states of the system. This will enable us to study their physical 
properties. It is now necessary to choose a gauge; we shall choose the one given 
by (12). The components of the velocity are then: 



y, 

K 




(69) 


a. The Hamiltonians H L and H lt . Relation with the two-dimensional harmonic 
oscillator 

Substituting (69)into (55), we obtain: 

H l = P ‘ 2 p P ’ + l‘ L - + ^ 2 ) PM 

H„ = 0 P0-b) 

where L z is the component along Oz of the angular momentum L = R x P. 

In the { | r > } representation, H fl is an operator which acts only on the 
variable z, while H ± acts only on the variables x and y. We can therefore find 
a basis of eigenvectors of H by solving in & z the eigenvalue equation of H^ and then, 
in that of H ± . All we must then do is take the tensor products of the vectors 
obtained. 

Actually, the eigenvalue equation of H h simply leads to the wave functions: 


cp(z) 


with: 


1 QiPzzjh 



(71) 


(72) 
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[we again find (59)]. Therefore, we shall concentrate on solving the eigenvalue 
equation of H ± in S xy ; the wave functions we shall be considering now depend 
on x and y, and not on z. 

Comparing (70-a) with expression (12-a) of complement D VI , we see that H ± 
can be expressed simply in terms of the Hamiltonian H xy of a two-dimensional 
harmonic oscillator: 


H ± = H . 


xy 



if we choose for the value of the constant which enters into H xy : 



(73) 

(74) 


Now, in complement D V |, we saw that H xy and L z form a C.S.C.O. in S xy , and we 
constructed a basis of eigenvectors | x nd „ ) common to these two observables 
[cf formula (47) of D v ,]. The | x„ d ,„ > are afso eigenvectors of H L ; complement D V | 
therefore gives the solutions to the eigenvalue equation of H x . 


comments : 

(i) In § 3-a, we saw that H L can be written in a form which is analogous to that 
of a Hamiltonian of a one-dimensional harmonic oscillator. Here, we find 
that, in a particular gauge, this same operator H L is also simply related to 
the Hamiltonian H xy of a two-dimensional harmonic oscillator. These two 
results are not contradictory; they simply correspond to two different 
decompositions of the same Hamiltonian, which must obviously lead to the 
same physical conclusions. 

(ii ) One must not lose sight of the fact that the Hamiltonian H L involves a 
physical problem which is completely different from that of the two-dimen¬ 
sional harmonic oscillator; the charged particle is subjected to a vector 
potential (describing a uniform magnetic field) and not a harmonic scalar 
potential (which would describe, for example, a non-uniform electric field). 
It so happens that, in the gauge chosen, the effects of the magnetic field can be 
likened to those of a fictitious harmonic scalar potential. 

p. Expression for the observables in terms of the creation and destruction 
operators of circular quanta 

First of all, we shall express the observables describing the quantities 
associated with the particle in terms of the operators a d and a g defined by 
equations (40) of complement D V i and their adjoints a' d and a* (we shall also 
use the operators N d =a* d a d and N g = a* g a g ). 

Substituting relations (46) of D V | into (73), we obtain*: 

H x = (N d +^h<o c (75) 


* Recall that we have assumed o> c to be positive. If o> c were negative, the indices d and g would have 
to be inverted in a certain number of the following formulas; for example, (75) would become: 

H l = (N g + l/2)fi K|. 
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The energy associated with the state | ) is therefore: 






H(o c 


(76) 


as we found in (66). Moreover, since E x is independent of n g , we see that all the 
eigenvalues of H ± are infinitely degenerate. 

Using relations (23) and (40) of D v „ we can see that : 


x = ^p( a d + a l + a g + a l) 

Y = ip ~ a » + 


(77) 


where, using (74), ft is defined by: 


/? = 



Similarly: 

p x =^f(~ <*d + «J - a g +a' g ) 
p y = y («d + al ~ a g - a*) 


(78) 


(79) 


These expressions, substituted into (69), yield: 
K= -jjj{a 4 -al) 

V y = +0j) 


(80) 


Since a d and a' d do not commute with N d , it can be seen by using (75) that, as in 
classical mechanics, V x and V are not constants of the motion; in addition, using 
the commutation relations of a d and a d , we indeed obtain (52-a). 

It is also interesting to study the quantum mechanical operators associated 
with the various variables introduced in the description of the classical motion 
(§1): the coordinates (x 0 , >> 0 ) of the center C 0 of the classical trajectory, the 
components (x', y') of the vector C 0 Q, etc. As above, we shall denote each of 
these operators by the capital letter corresponding to the small letter which 
designates the corresponding classical variable. By analogy with (10), we therefore 
set: 


*0 

^0 



(81-a) 


(81-b) 
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The operators a g and a' g commute with N d ; it follows that X 0 and Y 0 are constants 
of the motion. Formulas (81) also imply that: 

[*„. li] = ~ = — ( 82 ) 

W nco c 

Consequently, X 0 and T 0 are incompatible physical quantities, their root-mean- 
square deviations being related by: 

AX 0 .AY 0 >^ (83) 

We also define: 

X' = X - X 0 =^(a d +a*) 

2 P (84) 

Y' = Y - y 0 = ^(a d - al) 

We immediately see that X' and Y\ as in classical mechanics, are not constants 
of the motion; moreover, X' and Y' are simply proportional to V y and V x respec¬ 
tively : 

V x = - (o c Y' 

V y = <o c X' (85) 

like the corresponding classical variables [formula (9)]. According to (53), 

equations (85) imply : 

AX'. AY' ( 86 ) 


Let I 2 be the operator corresponding to a 2 (square of the radius of the classical 
trajectory): 


Z 2 = (X- X 0 ) 2 + (Y- T 0 ) 2 
According to (81), we have: 


-© 


(K +V 2 y )=—H ± 

pa 2 


I 2 is therefore a constant of the motion, as is o 2 in classical mechanics. 

Finally, the operator associated with the moment of the mechanical momentum /*v with 
respect to O is: 

@ t = n[(X - X 0 )V y — (Y — Y 0 )V X ] (89) 

and formulas (81) indicate that: 


0=—H, 


756 




A CHARGED PARTICLE IN A MAGNETIC FIELD 


as in (23). 6> 2 is therefore a constant of the motion. On the other hand, the operator A z , the 
component along Oz of pR x V, is: 

A z =^-H ± + h( ad a g + a' d a' g ) (91) 

and therefore does not commute with H ± . 


c. THE STATIONARY STATES 

We indicated above that the eigenvalues of the Hamiltonian H ± are all infinitely degene¬ 
rate in For each positive or zero integer w, there exists an infinite-dimensional subspace & { " y } 
of S xy , all of whose kets are eigenvectors of H ± with the same eigenvalues (n + 1/2 )fta) c . In this 
section, we shall study different bases which can be chosen in each of these subspaces. 
First, we shall indicate the general properties of the stationary states, valid for any basis of 
eigenstates of H ± . 


oc. General properties 

Relations (88) and (90) show that an arbitrary stationary state is necessarily an 
eigenvector of I 2 and 6> 2 ; the corresponding physical quantities are therefore always well- 
defined in such a state and are equal to: 

(2 n + 1)-^- for I 2 
pa) c 

(2 n + 1 )ti for 0 Z (92) 


The values of I 2 and 0 Z are proportional to the energy; this corresponds to the classical 
description of the motion ( ef . § 1). 

It follows from (80) and (84) that X\ Y\ V x and V y have no matrix elements inside a given 
subspace it follows, for a stationary state, that: 

<K> = = 0 

< x 9 > = < r > - 0 (93) 


Nevertheless, since V x and V y (and therefore X f and Y f ) are not constants of the motion, the 
corresponding physical quantities do not have perfectly well-defined values in a stationary 
state. In fact, by using (80), (84) and the properties of the one-dimensional harmonic oscillator 
[cf. chap. V, relation (D-5)], it can be shown that: 


AV X = w c AY' = 



+ _iy* a> « 


V n 


AV y = co c AX' = 



(94) 


in agreement with (53). Moreover, we see that the only stationary states in which the 
product A V x . A V y (or AX' . A Y') takes on its minimal value are the ground states (n = 0). 
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comment: 

The various ground states are solutions of the equation: 

I <P > = 0 (95-a) 

that is, using (80): 

{Vy + iV x ) I 9 > = 0 (95-b) 

as we found in (68). 

(}. The states \x„ d ,„ t > 

As we saw in complement D VI , the fact that H L and L z form a C.S.C.O. in $ xy can be 
used to construct a basis of eigenvectors common to these two observables. This basis is 
composed of the vectors | x„ d ,„ g X since, according to (75) and formula (46) of complement D vl : 


I Xn d ,» g > = +2 ) nWc I Xn '’ n ° > ( 96 " a ) 

K | l nd , ng > = (n d - n g ) h | Xn d ,n g > (96-b) 

The subspace defined by specifying the (non-negative) integer n is therefore spanned 
by the set of vectors | Xn d ,n g ) suc h that n d = The eigenvalues of L z associated with these 
different vectors are of the form mh , and, for fixed n , m is an integer which can vary between — oo 
and n (for example, all the ground states correspond to negative values of m ; this is related to 
the hypothesis <o c > 0 posed above). 

The wave functions associated with the states \ x nan ) were calculated in complement D VI 

(§3-d). 

Note that the states | x„ d ,» g ) are eigenstates of the operator L z , but not of the operator A z 
associated with the moment of the mechanical momentum. This can be seen directly from 
formula (91). 

In a state | X„ df n g X the mean values < X 0 ) and < Y 0 > are zero, according to (81). 
However, neither X 0 nor Y 0 corresponds to perfectly well-defined physical quantities, since, 
by using the properties of the one-dimensional harmonic oscillator, it can easily be shown that, 
in a state | x nd ,n a > • 


AX 0 = 


jy 0 = 


A » 

ne + V m> c 

i\ h 

n « + 2 )hoj c 


The minimal value of the product AX 0 . A Y 0 is therefore attained for the states | X nd , ng = o X 
that is, the states of each energy level E L = (n + l/2)fi<u c for which L z takes on its maximal 
value rih [c/. (96)]. 

However, let us define the operator: 

r 2 = X* + Y% (98) 

It corresponds to the square of the distance from the center C 0 of the trajectory to the origin. 
Using (81), we easily find: 

T 2 = — (a g a' g + a'a) 

fl(O c y y y 


(2N g +1) 
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The state | x„ d , ) is therefore an eigenstate of F 2 with the eigenvalue-(2 n g + 1); the fact 

that this value can never go to zero is related to the non-commutativity of the operators X 0 
and Y 0 . 



y . Other types of stationary states 

Any linear combination of vectors | X„ d , ng > associated with the same value of n d is an 
eigenstate of H ± and therefore possesses the properties stated in § 3-c-a. By a suitable choice 
of the coefficients of the linear combination, one can obtain stationary states which possess 
other interesting properties as well. 

We know, for example (§3-b-(i). that X 0 and Y 0 are constants of the motion. However, 
since X 0 and Y 0 do not commute, there are no eigenstates common to these two operators. 
This means that, in quantum mechanics, it is not possible to obtain a state in which the two 
coordinates of the point C 0 are known. 

To construct the eigenstates common to H ± and X 0 , we can use the properties of the 
one-dimensional harmonic oscillator; formula (81-a) shows that X 0 has the same expression, 
to within a constant factor, as the position operator X g of a one-dimensional oscillator whose 
destruction operator is a g : 

*0 = —'~r X* < 102 ) 

fiy/2 

Since we know the wave functions (p k (x ) associated with the stationary states | (p k > of a one¬ 
dimensional harmonic oscillator (cf complement B v , §2-b), we know how to write the 
eigenvectors | Jc > of the position operator as linear combinations of the states | (p k >: 

00 

I X > = X | <Pk > < <Pk I X > 

k = 0 

= £ £*(*)!£*> U03) 

k = 0 

In order to obtain the eigenstates common to H ± and X 0 it suffices to apply this result to the 
states | the vector: 

I £„,x„ > = E *o) | Xn d — n,n g — k > (104) 

k = 0 

is a common eigenvector of H L and X 0 with the eigenvalues (n + \j2)hoj c and x 0 . 
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The eigenstates | rj n yo > common to H ± and Y 0 can be found in an analogous fashion. 
Relation (81-b) indicates that Y 0 is proportional to the momentum operator P g of the ficticious 
one-dimensional oscillator just used: 



(105) 


Consequently [see formula (20) of complement D v ]: 


*ln.y „> = Z mPV2y 0 ) | 

k = 0 


(106) 


We have just constructed the states in which either X 0 or Y 0 is perfectly well-defined. 
We can also determine the stationary states in which the product AX 0 . A Y 0 reaches its minimal 
value, given by (83). For a one-dimensional harmonic oscillator, we studied in complement G v 
the states in which the product AX . AP is minimal; these are the quasi-classical states, given by : 


I “ > = Z c k (a) | (p k > 

k = 0 

with: 



(107) 


(108) 


In these states: 


AX — AP = 


sTi 


(109) 


It follows that, in the case which interests us here, the state: 


|0n,«„> = Z C *( a o) I ( 110 ) 

fc= 0 

yields, for X 0 and Y 0 , root-mean-square deviations: 

AX 0 = AY 0 =y p (111) 

The product AX 0 . A Y 0 is therefore minimal. 


comment: 

Since the magnetic field is uniform, the physical problem we are considering is 
invariant with respect to translation. Thus far, this symmetry has been masked by the 
choice of the particular gauge (12), which gives the origin O a privileged position with 
respect to all other points in space. Consequently, neither the Hamiltonian H nor its 
eigenstates are invariant with respect to translation. We know, however (c/, comple¬ 
ment H m ) that the physical predictions of quantum mechanics are gauge-invariant. 
These predictions must remain the same if, by a change of gauge, we give a point other 
than O a privileged position. Consequently, the translation symmetry must reappear 
when we study the physical properties of a given state. 

To show this more precisely, let us assume that, at a given instant, the state of 
the particle is characterized in the gauge (12) by the ket | \j/ > with which the wave func- 
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tion < r | \j/ > = i^(r)is associated. We then perform a translation T? defined by the vector a, 
and consider the ket | ij/ T > defined by: 

i 

\* T > = e ~' P *| > (112) 

with which, according to the results of complement E„, is associated the wave function: 

iMr) = < r | * r > = *(r - a) (113) 

The same translation can be applied to the vector potential, which becomes: 


A r (r) = A(r - a) = - - (r - a) x B (114) 

A r (r) clearly describes the same magnetic field as A(r). Since the physical properties 
attached to a given state vector depend only on this state vector and the potential A 
chosen, they must undergo the translation G when \j/( r) and A(r) are replaced by 
expressions (113) and (114). It is simple to use these relations to obtain the expression 
for the probability density associated with | ij/ T ): 

Pr(r) = |'M r )| 2 = |^(r ~ »)| 2 = p(r ~ ») (115) 

and that for the current J r (r), calculated with the vector potential A r (r): 

J r (r) = Y <T ^(r)|"j V + | (r - a) X + c.c. 1» 


V + |(r 


a) x B \j /(r — 


J(r - a) 


[where J(r) is the probability current associated with ^(r) in the gauge (12)]. The 
ket | \j/ T ) therefore describes, in the new gauge A r (r), a state whose physical properties 
are related by the translation £ to those corresponding to the ket | ^ > in the gauge A(r). 

Let us show, moreover, that the translation of a possible motion yields another 
possible motion; this will conclude the proof of the translation invariance of the problem. 
To do so, consider the Schrodinger equation in the { | r ) } representation, in the 
gauge A(r).: 


r ~ ~~|2 

ihj t iA(r. t) = I * v - <?A(r)J 


Changing r to r — a in this equation, we obtain, using (113) and (114): 


d 1 \~h ~ ^ 

ih — «^ r (r, 0 = — I -- V - < 7 A r (r) t) 


The operator which appears on the right-hand side of (118) is none other than the 
Hamiltonian in the gauge A r (r). Consequently, if ij/( r, /) describes, in the gauge A(r), 
a possible motion of the system, \j/ T ( r, t) describes, in the equivalent gauge A r (r), another 
possible motion, which, according to what we have just shown, is nothing more than 
the result of a translation of the first motion. In particular, if 

t/f(r, t) — (p( r) e~ ,£f/ * 
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is a stationary state [in the gauge A(r)], 

•M r > t) = <P T (r)s~ iE " h 

is another stationnary state of the same energy [in the gauge A r (r)]. 

If we want to continue to use the gauge (12) after having performed the trans¬ 
lation on the physical state of the particle, we must describe the translated state by 
a mathematical ket | \j/ f T ) which is different from | \j/ T ). According to §3-b-a of 
complement H m , the ket | \j/ f T ) can be obtained from | i)/ T ) by a unitary transformation : 

\r T > = t x i^ r > (H9) 


The operator T x is given by: 
T = e^* (R) 


( 120 ) 


where x(r) is the function characterizing the gauge transformation performed. Here, 
the potential after the gauge change is: 

A(r) = - 2 r X B = A r( r ) - * B (121) 

so that: 

X (r)= -Ir.(axB) (122) 


Substituting (112), (120) and (122) into (119), we finally obtain: 

|«Ar> = */(■)!*> 

with: 


(123) 

(124) 


Therefore, if we remain in the gauge A(r), the translation operator is f/(a), given by (124). 

The components of Rand P along two perpendicular axes enter into formula (124); 
they therefore commute, so we can write: 


q i 

tji v i R.(*xB) P.> 

c/(a) = e 2 ^ n 


(125) 


When a is a vector of the xOy plane, a simple calculation, using formulas (10) and (69), 


yields: 


q 

1U \ i-(axRo).B 

l/(a) = e » 

(126) 

with: 


R 0 = ^0 e x + *0 e y 

(127) 


The operators X 0 and Y Q (coordinates of the center of the circle) are therefore associated 
with infinitesimal translations along Oy and Ox respectively. 


d. TIME EVOLUTION 

a. Mean values of the observables 

We have already encountered a certain number of physical quantities which 
are constants of the motion: X 0 , Y 0 , 0 z , T 2 . Whatever the state of the system, 
their mean values are time-independent. 
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Let us examine the time evolution of the mean values < X >, < T >, < V x }, < V y } 
and < X' >, < Y' ). We immediately see from the expressions given in § 3-b-P that 
the corresponding operators have matrix elements only between states | x ,, d ,, g ) 
whose values of n d differ by + 1 (or 0). The evolution of these mean values therefore 
involves only one Bohr frequency, which is none other than the cyclotron 
frequency (ojln defined in (5). 

This result is completely analogous to the one given by classical mechanics. 


P. Quasi-classical states 

Assume that at / = 0 the state of the particle is: 

|<M°) >= K’ a *> (128) 

where the ket | <x d , <x g > is defined by expression (70) of complement D VI . Since expression (75) 
for H involves N d but not N g , the state vector | i/'i(0 / at the instant t is obtained by chang¬ 
ing y. d to tx d e~ it0c ' : 

I MO > = e" , '" M/2 | a d e _i<M , a g > (129) 

[cf. expression (92) of complement G v ], 

We set: 

“j ~ Kl ei<P “ 

a,= |aje% (130) 

Relations (80), (81) and (84) then show that : 

< x ° > = Tfi ^ 9 + = ¥° os 

<> 

< r 0 > = ip ( a * _ =¥ sin v* 0 31 ) 

< >w = ¥ (“o e_i “ c ' + a * eiWc ') = ¥ cos K* - <p„) 

2p p 

< 

<ry(t) = Tp (a “ e " ,< " e, “ a * e '“ c,) = ¥ sinKt “ Vi) (132) 

and: 

< K >(0 = - ¥ sin ^ M 

< > (0 = ¥ cos ^ ^ ( 1 33) 


Moreover, the properties of the states | a ) imply that : 

< > = 1io) c (\a. d \ 2 + £) 

< 0 2 > = 2 P >(\« d \ 2 + -^ 

< I2 > =J 2 {\^ + ~2> (,34) 
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All these results are extremely close to those given by classical mechanics [cf (4)]. We see 
that \<x d \ is related to the radius a of the classical trajectory and cp d , to the initial phase cp 0 , 
while \<x g \ is related to the distance 0C O , and cp g corresponds to the polar angle of the vector OC 0 . 

Furthermore, the properties of the states | a ) can be used to show that: 

AX 0 - AY 0 = AX' = AY' (135-a) 

Zp 

AV *- AV r-J$ < 135 ' b ) 

(the products AX 0 . AY 0 , AX' .AY' and A V x . A V y therefore take on their minimal values), and : 

AH ± = hw c \a d \ 

A0 Z =2h |a d | 

AI 2 = — 2 \a- d \ (136) 

As for the deviations AX and A Y , they can be calculated by using the fact that: 


*Jf) > = e -“ c ' /2 | = 


cc d e“° c ' + a 


ia,e 


-.o> c ( _ , 


ioc„ 


n/ 2 


' , a. = • 




[where | x x> a y ) is defined by relation (66) of D VI ], which yields : 
!~h~ 1 

AX = AY = j -= • 




(AP X and AP y can easily be obtained in the same way). 
If the conditions: 

hi > i. hi > i 


(137) 


(138) 


(139) 


are satisfied, we see, therefore, that the various physical quantities (position, velocity, energy,...) 
are, in relative value, very well defined. The states (129) therefore represent “quasi-classical” 
states of the charged particle placed in a uniform magnetic field. 


comment: 


If a d = 0, we obtain: 


<»±> = 

^ AH l = 0 


The states: 


I = - K > 

therefore correspond to the ground state. 


(140) 

(141) 


References and suggestions for further reading : 

Landau and Lifshitz (1.19), chap. XVI, §§124 and 125; Ter Haar (1.23), chap. 6. 
Application to solid state physics : Mott and Jones (13.7), chap. VI, §6; Kittel (13.2), 
chap. 8, p. 239 and chap. 9, p. 290. 
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EXERCISES 


1. Consider a system of angular momentum j = 1, whose state space is spanned 
by the basis { | + 1 >, | 0 >, | — 1 > } of three eigenvectors common to J 2 (eigen¬ 
value 2ti 2 ) and J z (respective eigenvalues + h, 0 and — h). The state of the system 
is: 

|^> = a| + l> + /?|0> + y|-l> 

where a, ft, y are three given complex parameters. 

a. Calculate the mean value < J > of the angular momentum in terms of a, 

and y. 

b. Give the expression for the three mean values < J 2 X ), < J 2 > and < J \) 
in terms of the same quantities. 


2. Consider an arbitrary physical system whose four-dimensional state space is 
spanned by a basis of four eigenvectors \j, m z ) common to J 2 and J z {j = 0 or 1; 
- j ^ m, < -|- j), of eigenvalues j(j + l )h 2 and m z h, such that : 

J± 1 1 m z > = & y/j(j + 1) - m z (m z ± 1) \j,m z ± 1 > 

J+ | h j> = J-\h~j>=0 

a. Express in terms of the kets \j, m, ), the eigenstates common to J 2 and 
J x , to be denoted by | j, m x ). 

b. Consider a system in the normalized state: 

I ^ > = a \ j = 1, = 1 > + P |> = 1, ™ z = 0 > 

+ y |y = 1, m z = - 1 > + 8 \j = 0, m z = 0 > 

(i) What is the probability of finding 2h 2 and b if J 2 and J x are 
measured simultaneously? 

(ii) Calculate the mean value of J z when the system is in the state | ip ), 
and the probabilities of the various possible results of a measurement bearing 
only on this observable. 

(iii) Same questions for the observable J 2 and for J x . 

(iv) J 2 is now measured; what are the possible results, their proba¬ 
bilities, and their mean value ? 

3. Let L = R x P be the angular momentum of a system whose state is S t . 
Prove the commutation relations : 

[Z.j, RjJ if i R k 

[L f , Pj] = it, e ijk P k 

[L f ,P 2 ] = [L f ,R 2 ] = [L ( , R . P] =0 


765 





COMPLEMENT F 


VI 


where L { , Rj , P ■ denote arbitrary components of L, R, P in an orthonormal 
system, and e ijk is defined by : 



if two (or three) of the indices i, j, k are equal 
if these indices are an even permutation of x, y, z 
if the permutation is odd. 


4. Rotation of a polyatomic molecule 

Consider a system composed of N different particles, of positions R 1? 

R m , R n , and momenta P x , P m , P N . We set: 

J = 1K 

m 

with: 

L m = R m x P m 

a. Show that the operator J satisfies the commutation relations which define 
an angular momentum, and deduce from this that if V and V' denote two ordinary 
vectors of three-dimensional space, then: 

[J . V, J . V'] = ih{X x V') . J 

b. Calculate the commutators of J with the three components of R m and 
with those of P m . Show that: 

[J, R m . R p ] = 0 

c. Prove that: 

[J,J.R m ] = 0 

and deduce from this the relation: 

[J . R m , J . R m ,] - x R m ). J = ih J . (R m , x RJ 

We set: 

W = £ a m R m 

m 

W'=2> m R m 

m 

where the coefficients a m and a' m are given. Show that: 

[J . W, J . W] = - /S(Wx W'). J 

Conclusion: what is the difference between the commutation relations of the 
components of J along fixed axes and those of the components of J along the 
moving axes of the system being studied ? 

d. Consider a molecule which is formed by N unaligned atoms whose 
relative distances are assumed to be invariant (a rigid rotator). J is the sum of the 
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angular momenta of the atoms with respect to the center of mass of the 
molecule, situated at a fixed point O ; the Oxyz axes constitute a fixed orthonormal 
frame. The three principal inertial axes of the system are denoted by Oa, 0/1 and Oy. 
with the ellipsoid of inertia assumed to be an ellipsoid of revolution about Oy 
(a symmetrical rotator). The rotational energy of the molecule is then : 




+ 


Jl ± 3 




where J a , J fi and J y are the components of J along the unit vectors w # , and w y of 
the moving axes Oa, Ofl, Oy attached to the molecule, and 7 ;/ and I ± are the cor¬ 
responding moments of inertia. We grant that: 

Jl + J0 + J 2 y =J 2 X + Jy + J\ = J 2 


(/) Derive the commutation relations of J x , J p , J y from the results of c. 

(i ii) We introduce the operators N ± = J x ± Up. Using the general 
arguments of chapter VI, show that one can find eigenvectors common to J 2 and J y , 
of eigenvalues J{J -I- l)h 2 and Kh, with K = — J, —7+1, ..., J — 1, J. 

(iii) Express the Hamiltonian 77 of the rotator in terms of J 2 and J 2 . 
Find its eigenvalues. 

(iv) Show that one can find eigenstates common to J 2 , J z and J y , to be 
denoted by | J, M, K} [the respective eigenvalues are J{J -I- 1 fi 2 , Mh, Kh~\. Show 
that these states are also eigenstates of H. 

(v) Calculate the commutators of J ± and N ± with J 2 , J z , J y . Derive 
from them the action of J ± and N ± on \J, M, K >. Show that the eigenvalues 
of H are at least 2(27 -I- 1 )-fold degenerate if K # 0, and (2 J + 1 )-fold degenerate 
if K = 0. 

(vi) Draw the energy diagram of the rigid rotator (J is an integer 
since J is a sum of orbital angular momenta; cf. chapter X). What happens to this 
diagram when /// = 7 ± (spherical rotator)? 


5. A system whose state space is has for its wave function : 
i l/{x, y, z) = N(x + y + z) e~' 2,a2 

where a, which is real, is given and A is a normalization constant. 

a. The observables L z and L 2 are measured; what are the probabilities of 
finding 0 and 2h 2 l Recall that: 

fj 

Y?(0, <p) = ^ cos e 

b. If one also uses the fact that: 

FT 

YF l (d,(p)= + /—sin0e ±, * > 

V ° n 

is it possible to predict directly the probabilities of all possible results of measu¬ 
rements of L 2 and L z in the system of wave function tp(x, y, z)? 
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6. Consider a system of angular momentum / = 1. A basis of its state space 
is formed by the three eigenvectors of L 2 : | + 1 >, 10 >, | — 1 >, whose eigenvalues 
are, respectively, + h, 0, and — h, and which satisfy: 

L ± \ m } = ft y/l | m + 1 > 

L + | 1 > = | - 1 > = 0 

This system, which possesses an electric quadrupole moment, is placed in an electric 
field gradient, so that its Hamiltonian can be written: 

H=^{Ll-Ll) 

where L u and L v are the components of L along the two directions Ou and Ov 
of the xOz plane which form angles of 45° with Ox and Oz\ u> 0 is a real constant. 

a. Write the matrix which represents H in the { | + 1 ), 10 ), | — I ) } basis. 
What are the stationary states of the system, and what are their energies? (These 
states are to be written | E x ), | E 2 ), | E 3 ), in order of decreasing energies.) 

b. At time t = 0, the system is in the state : 

l#>)>- t=[ | + 1 > - | - 1 >] 

v 2 

What is the state vector | ip(t) > at time t ? At t, L z is measured; what are the proba¬ 
bilities of the various possible results? 

c. Calculate the mean values (L y )(t) and < L z }(t ) at t. What is 

the motion performed by the vector < L > ? 

d. At /, a measurement of L\ is performed. 

(/) Do times exist when only one result is possible? 

(//) Assume that this measurement has yielded the result h 2 . What is the 
state of the system immediately after the measurement? Indicate, without calcu¬ 
lation, its subsequent evolution. 


7. Consider rotations in ordinary three-dimensional space, to be denoted 
by ^„(a), where u is the unit vector which defines the axis of rotation and a is the 
angle of rotation. 

a. Show that, if A/' is the transform of M under an infinitesimal rotation 
of angle e, then: 


OM' = OM + £U x OM 


b. If OM is represented by the column vector 




y |, what is the matrix 

\Z/ 

associated with dt a (e)l Derive from it the matrices which represent the components 
of the operator M defined by: 

= 1 + e J(. u 


768 


EXERCISES 


c. Calculate the commutators : 

[J? z , jt x -\ 

What are the quantum mechanical analogues of the purely geometrical rela¬ 
tions obtained ? 

d. Starting with the matrix which represents Ji z , calculate the one which 

represents show that 0t.(a) = e a “**; what is the analogue of this relation in 

quantum mechanics ? 

8. Consider a particle in three-dimensional space, whose state vector is | tj/ >, 
and whose wave function is ip(r) = < r | ^>. Let A be an observable which 
commutes with L = R x P, the orbital angular momentum of the particle. 
Assuming that A , L 2 and L z form a C.S.C.O. in S t , call | n, l,m > their common 
eigenkets, whose eigenvalues are, respectively, a n (the index n is assumed to be 
discrete), /(/ -|- 1 ft 2 and mh. 

Let U (<p) be the unitary operator defined by: 

U((p) = 

where <p is a real dimensionless parameter. For an arbitrary operator K, we 
call K the transform of K by the unitary operator U(<p): 

K = C(<p)/0/ f (<p) 

a. We set L + = L x + iL y , L_ = L x — iL y . Calculate L + \n, l, my and 
show that L + and L + are proportional; calculate the proportionality constant. 
Same question for L_ and L_ . 

b. Express L x , L y and L z in terms of L x , L y and L z . What geometrical 
transformation can be associated with the transformation of L into L? 

c. Calculate the commutators [X + iY, L z ] and [Z, Z.J. Show that the 
kets (X + iY) | n, /, m > and Z | n, /, m > are eigenvectors of L z and calculate their 
eigenvalues. What relation must exist between m and m' for the matrix ele¬ 
ment <«', m' | X ± iY\n, l,m} to be non-zero? Same question for: 

<«', /', m' | Z \ n, /, m ). 

d. By comparing the matrix elements of X ± iY and Z with those of A' ± iY 
and Z, calculate X, K, Z in terms of X , K, Z. Give a geometrical interpretation. 


9. Consider a physical system of fixed angular momentum /, whose state 
space is <?,, and whose state vector is |^>; its orbital angular momentum 
operator is denoted by L. We assume that a basis of , is composed of 21 + 1 eigen¬ 
vectors 1 1, my of L z {— l ^ m ^ + /), associated with the wave func- 
tions /(r)y™(0, cp). We call <L> = <i/'|L|i/'>the mean value of L. 

a. We begin by assuming that : 

< l x y = < L y y = o 
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Out of all the possible states of the system, what are those for which the 
sum ( AL X ) 2 + {AL y ) 2 + (AL Z ) 2 is minimal? Show that, for these states, the root- 
mean-square deviation AL a of the component of L along an axis which is at an 
angle a with Oz is given by: 


AL=h h? sin a 

1 V 2 


b. We now assume that < L) has an arbitrary direction with respect to 
the Oxyz axes. We denote by OXYZ a frame whose OZ axis is directed along < L ), 
with the O Y axis in the xOy plane. 

(i) Show that the state | tJ/ 0 ) of the system for which: 

(AL X ) 2 + (AL y ) 2 + (AL Z ) 2 
is minimal is such that: 


(L x + iL y ) |^ 0 > = 0 

L z |«A 0 > = 'fc|^o> 


(//) Let 0 o be the angle between Oz and OZ, and <p 0 , the angle 
between Oy and O Y ; prove the relations: 


L x + iLy = cos 2 -^e~* > °L + - sin 2 -^e^L_ - sin0 o L 2 

L z = sin-^f cos% e~ i<fio L, + sin C os % e i,>0 L _ +cos0 o L 

z 2 2 2 2 0 

If we set: 

I tf'o > = X d m I J. m > 

m 


show that: 


d„ = tan e'*’ 0 


// + tn +1 

i — - d, 


V 


\ - 


m 


m + 1 


Express d m in terms of d t , 6 0 , <p 0 and /. 

(hi) To calculate d t , show that the wave function associated with | ip 0 ) 

is tA 0 (X, y, Z) = c, — l — f(r) [where c, is defined by equation (D-20) of 

r 

_|. jyV 

chapter VI], the one associated with | /, /> being c t - -- f(r). By replacing X, Y 

r 

and Z in this expression for ip 0 {X, Y, Z) by their values in terms of x, y, z, find 
the value of d t and the relation: 



(2/)! 

(/ + m)\ (l — m)\ 
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(/v) With the system in the state | ij/ Q >, L z is measured. What are the 
probabilities of the various possible results? What is the most probable result? 
Show that, if / is much greater than 1, the results correspond to the classical limit. 


10. Let J be the angular momentum operator of an arbitrary physical system 
whose state vector is | ip ). 

a. Can states of the system be found for which the root-mean-square 
deviations AJ X , AJ y and AJ Z are simultaneously zero? 

b. Prove the relation: 

and those obtained by cyclic permutation of x, y, z. 

Let < J) be the mean value of the angular momentum of the system. The 
Oxyz axes are assumed to be chosen in such a way that < J*) = <•/,,> = 0- Show 
that: 


(AJ X ) 2 + {Aj y f>h\(j z y\ 

c. Show that the two inequalities proven in question b. both become equalities 
if and only if J + | ^ > = 0 or J_ \ ip > = 0. 

d. The system under consideration is a spinless particle for which 

J = L = R x P. Show that it is not possible to have both AL X . AL y =■ | < L z > | 

and [AL X ) 2 -I- {AL y ) 2 = h |<L Z >| unless the wave function of the system is of the 
form: 


<A(r, 6, (p) = F(r , sin 0 e*'*) 


11. Consider a three-dimensional harmonic oscillator, whose state vector | ^ ) is: 

| *1* > = | <*x > ® I a* > ® I a z > 

where | a. x ), | > and | a z > are quasi-classical states (cf. complement G v ) for one¬ 

dimensional harmonic oscillators moving along Ox, Oy and Oz, respectively. 
Let L = Rx P be the orbital angular momentum of the three-dimensional 
oscillator. 

a. Prove: 

< K ) = M ~ 

AL Z =TiV |a x | 2 +|aj 2 

and the analogous expressions for the components of L along Ox and Oy. 

b. We now assume that: 

< L x > = < L y > = 0 , < L z > = xn > 0 
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Show that a z must be zero. We then fix the value of X. Show that, in order to 
minimize AL X + AL y , we must choose: 



(where <p 0 is an arbitrary real number). Do the expressions AL X . AL y and 
( AL X ) 2 + ( AL y ) 2 in this case have minimum values which are compatible with 
the inequalities obtained in question b. of the preceding exercise? 

c. Show that the state of a system for which the preceding conditions are 
satisfied is necessarily of the form: 

I ) = X C k( a d) I Xn d = k,n g = 0.n z = 0 ) 
k 

with: 

I \ _ ( fl » * a y) I q \ 

| A,nd = k ,n 9 — 0,n z = 0 / J—~ - j *rn x — 0,n y — 0,n r = 0 / 

v 2 k k ! 

c k (a) = —e “ |a|2/2 ; ocj = e ,<Po \/a 

V k ! 

(the results of complement G v and of § 4 of complement D VI can be used). Show 
that the angular dependence of | X nd =k =o,» I = o ) * s ( s ' n ® e,> ) fc - 

L 2 is measured on a system in the state 1 1 p ). Show that the probabilities 
of the various possible results are given by a Poisson distribution. What results 
can be obtained in a measurement of L z which follows a measurement of L 2 whose 
result was /(/ + 1 fi 2 l 


Reference: 

Exercise 4 : Landau and Lifshitz (1.19), § 101; Ter Haar (1.23), §§8.13 and 8.14. 
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in a central potential. 
The hydrogen atom 



OUTLINE OF CHAPTER VII 


A. STATIONARY STATES 
OF A PARTICLE 
IN A CENTRAL POTENTIAL 


1. Outline of the problem 

a. Review of some classical results 

b . The quantum mechanical Hamiltonian 

2. Separation of variables 

a. Angular dependence of the eigenfunctions 

b. The radial equation 

c. Behavior of the solutions of the radial equation at the origin 

3. Stationary states of a particle in a central potential 

a. Quantum numbers 

b. Degeneracy of the energy levels 


B. MOTION OF THE CENTER 
OF MASS 

AND RELATIVE MOTION 
FOR A SYSTEM 
OF TWO INTERACTING 
PARTICLES 


1. Motion of the center of mass and relative motion in classical 
mechanics 

2. Separation of variables in quantum mechanics 

a . Observables associated with the center of mass and with the 
relative particle 

b . Eigenvalues and eigenfunctions of the Hamiltonian 


C. THE HYDROGEN ATOM 


1. Introduction 

2. The Bohr model 

3. Quantum mechanical theory of the hydrogen atom 

a . Change of variables 

b . Solving the radial equation 

c. Energy quantization. Radial functions 

4. Discussion of the results 

a . Order of magnitude of the atomic parameters 

b . Energy levels 

c . Wave functions 
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In this chapter, we shall consider the quantum mechanical properties of a particle 
placed in a central potential [that is, a potential V (r) which depends only on the 
distance r from the origin]. This problem is closely related to the study of angular 
momentum presented in the preceding chapter. As we shall see in §A, the fact 
that V(r) is invariant under any rotation about the origin means that the 
Hamiltonian H of the particle commutes with the three components of the orbital 
angular momentum operator L. This considerably simplifies the determination 
of the eigenfunctions and eigenvalues of H, since these functions can be required 
to be eigenfunctions of L 2 and L z as well. This immediately defines their angular 
dependence and the eigenvalue equation of H can be replaced by a differential 
equation involving only the variable r. 

The importance of this problem derives from a property which will be 
established in § B : a two-particle system in which the interaction is described 
by a potential energy which depends only on the relative positions of the particles 
can be reduced to a simpler problem involving only one fictitious particle. In 
addition, when the interaction potential of the two particles depends only on the 
distance between them, the fictitious particle’s motion is governed by a central 
potential. This explains why the problem considered in this chapter is of such 
general interest: it is encountered in quantum mechanics whenever we investigate 
the behavior of an isolated system composed of two interacting particles. 

In §C, we shall apply the general methods already described to a special 
case : that in which V(r) is a Coulomb potential. The hydrogen atom, composed 
of an electron and a proton which electrostatically attract each other, supplies 
the simplest example of a system of this type. It is not the only one: in addition 
to hydrogen isotopes (deuterium, tritium), there are the hydrogenoid ions, 
which are systems composed of a single electron and a nucleus, such as the ions He + , 
Li + + , etc... (other examples will be given in complement A v „). For these systems, 
we shall explicitly calculate the energies of the bound states and the corresponding 
wave functions. We also recall the fact that, historically, quantum mechanics was 
introduced in order to explain atomic properties (in particular, those of the 
simplest atom, hydrogen), which could not be accounted for by classical mechanics. 
The remarkable agreement between the theoretical predictions and the experimental 
observations constitutes one of the most spectacular successes of this branch of 
physics. Finally, it should be noted that the exact results concerning the hydrogen 
atom serve as the basis of all approximate calculations relating to more complex 
atoms (having several electrons). 
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CHAPTER VII CENTRAL POTENTIAL. THE HYDROGEN ATOM 


A. STATIONARY STATES OF A PARTICLE 
IN A CENTRAL POTENTIAL 


In this section, we consider a (spinless) particle of mass n, subject to a central 
force derived from the potential V(r) (the center of force is chosen as the origin). 


1. Outline of the problem 


a. REVIEW OF SOME CLASSICAL RESULTS 


The force acting on the classical particle situated at the point M (with OM = r) 
is equal to: 


F = 


V V(r) = 


dV r 
dr r 


(A-l) 


F is therefore always directed towards O, and its moment with respect to this 
point is therefore always zero. If: 

if = rxp (A-2) 


is the angular momentum of the particle with respect to O, the angular momentum 
theorem then implies that: 


d& 

dr 


= 0 


(A-3) 


& is therefore a constant of the motion, and the particle’s trajectory is therefore 
necessarily situated in the plane passing through O and perpendicular to &. 



Now let us consider (fig. 1) the position (denoted by OM = r) and velocity v 
of the particle at the instant r. The two vectors r and v lie in the plane of the trajectory, 
and the velocity v can be decomposed into the radial component v r (along the axis 
defined by r) and the tangential component v x (along the axis perpendicular to r). 
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The radial velocity, the algebraic value of v r , is the time derivative of the distance 
of the particle from the point O : 



The tangential velocity can be expressed in terms of r and the angular momentum .Sf, 
since : 


|r x v| = r |vj 

so that the modulus of the angular momentum jSf is equal to : 


(A-5) 


\£f \ = |r x p v| = p r |v ± | (A-6) 

The total energy of the particle: 

£=^v 2 + V(r) = ±pv? + \py\ + K(r) (A-7) 

can then be written: 

E=±-fiv? +f^~ +V(r) (A-8) 

2 2 /u r 1 

The classical Hamiltonian of the system is then: 

M = iL + v(r) (A-9) 

2 M 2 nr 2 ’ 


where: 

dr 

Pr = (A-10) 

is the conjugate momentum of r, and & 2 must be expressed in terms of the 
variables r, 6, <p and their conjugate momenta p r ,p e ,p 0 . One finds (cf. appendix III, 
§ 4-a): 

# 2 =p 2 e+-^ r -pl (A-11) 

sin 2 8 

In expression (A-9), the kinetic energy is broken into two terms: the radial 
kinetic energy and the kinetic energy of rotation about O. The reason is that, 
since V (r) is independent of 6 and in this case, the angular variables and their 
conjugate momenta appear only in the-S? 2 term. In fact, if we are interested in the 
evolution of r, we can use the fact that & is a constant of the motion, and 
replace <5f 2 by a constant in expression (A-9). The Hamiltonian then appears as 
a function only of the radial variables r and p r (JS? 2 plays the role of a parameter), 
and the result is a differential equation involving only one variable, r : 

(A-12, 

df dt 2 dr p r 3 dr 
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It is just as if we had a one-dimensional problem (with r varying only between 0 
and + oo), with a particle of mass n subjected to the “effective potential” : 

F eff (r)= V(r) +^- 2 (A-13) 

2n r 

We shall see that the situation is analogous in quantum mechanics. 


b. THE QUANTUM MECHANICAL HAMILTONIAN 


In quantum mechanics, we want to solve the eigenvalue equation of the 
Hamiltonian H , the observable associated with the total energy. This equation is 
written, in the { | r ) } representation: 

~J^ A+ F ( r )J<P( r ) = E( P ( r ) ( A ‘ 14 ) 

Since the potential V depends only on the distance r of the particle from 
the origin, spherical coordinates (cf. §D-l-a of chapter VI) are best adapted to the 
problem. We therefore express the Laplacian A in spherical coordinates* : 

A =iil r +-(-- | 1 c ’| 1 _ d l_ 

r dr 2 r 2 \<30 2 tan 6 86 sin 2 6 8(p 2 


(A-15) 


and look for eigenfunctions <p(r) which are functions of the variables r, 6, (p. 

If we compare expression (A-15) with the one for the operator L 2 [for¬ 
mula (D-6-a) of chapter VI], we see that the quantum mechanical Hamiltonian H 
can be put in a form which is completely analogous to (A-9): 


H = - 


2jit r dr 2 2p. r 2 


+ V{r) 


(A-16) 


The angular dependence of the Hamiltonian is contained entirely in the L 2 term, 
which is an operator here. We could, in fact, perfect the analogy by defining an 
operator P r , which would allow us to write the first term of (A-16) like the one 
in (A-9). 

We shall now show how one can solve the eigenvalue equation: 

[- r + ^? V + nr) ]^’ = E 6 ’ <p) (A ' 17) 


* Expression (A-15) gives the Laplacian only for non-zero r. This is because of the privileged 
position of the origin in spherical coordinates; it can be seen, moreover, that expression (A-15) is not 
defined for r = 0, 
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2. Separation of variables 

a. ANGULAR DEPENDENCE OF THE EIGENFUNCTIONS 

We know \cf formulas (D-5) of chapter VI] that the three components 
of the angular momentum operator L act only on the angular variables 8 and cp ; 
consequently, they commute with all operators which act only on the /--dependence. 
In addition, they commute with L 2 . Therefore, according to expression (A-16) 
for the Hamiltonian, the three components of L are constants of the motion* in the 
quantum mechanical sense: 

[H, L] =0 (A-18) 

Obviously, H also commutes with L 2 . 

Although we have at our disposition four constants of the motion (L x , L y , L z 
and L 2 ), we cannot use all four of them to solve equation (A-17) because they do not 
commute with each other; we shall use only L 2 and L z . Since the three 
observables H, L 2 and L z commute, we can find a basis of the state space $, of the 
particle composed of eigenfunctions common to these three observables. We can, 
therefore, without restricting the generality of the problem posed in § 1 above, 
require the functions <p(r, 6 , (p), solutions of equation (A-17), to be eigenfunctions 
of L 2 and L z as well. We must then solve the system of differential equations: 

Hq>{ r) = E (p(r) (A-19-a) 

L 2 <p(r) = /(/ + \)h 2 (p(r) (A-19-b) 

L z <p( r) =mh(p(r) (A-19-c) 

But we already know the general form of the common eigenfunctions of L 2 
and L z (chap. VI, §D-l-b-(5): the solutions <p(r) of equations (A-19), corresponding 
to fixed values of / and m, are necessarily products of a function of r alone and the 
spherical harmonic Y™(8, (p): 

<p(r) = R(r)Yne,cp) (A-20) 

Whatever the radial function R(r), <p(r) is a solution of equations (A-19-b) 
and (A-19-c). The only problem which remains to be solved is therefore how 
to determine /?(/•) such that <p( r) is also an eigenfunction of H [equation (A-19-a)]. 


b. THE RADIAL EQUATION 


We shall now substitute expressions (A-16) and (A-20) into equation (A-19-a). 
Since <p( r) is an eigenfunction of L 2 with the eigenvalue /(/ + \)Ti 2 , we see 
that Y?(8, <p) is a common factor on both sides. After simplifying, we obtain the 
radial equation: 



Ti 2 Id 2 , /(/ + l)h 2 

- r + — - - — 

2p r dr 2 2p r 2 



R(r) = E R(r) 


(A-21) 


* Equation (A-18) expresses the fact that H is a scalar operator with respect to rotations about the 
point O (see complement B vr ). This is true because the potential energy is invariant under rotationsabout O. 
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Actually, a solution of (A-21), substituted into (A-20), does not necessarily 
yield a solution of the eigenvalue equation (A-14) of the Hamiltonian. As we have 
already pointed out ( cf. note on page 778), expression (A-l5) for the Laplacian is 
not necessarily valid at r = 0. We must therefore make sure that the behavior of the 
solutions R(r) of (A-21) at the origin is sufficiently regular for (A-20) to be in fact 
a solution of (A-14). 

Instead of solving the partial differential equation (A-17) in the three 
variables r, 6, q>, we must now solve a differential equation involving only the 
variable r, but dependent on a parameter /: we are looking for eigenvalues and 
eigenfunctions of an operator H, which is different for each value of /. 

In other words, we consider separately, in the state space S t , the 
subspaces S(l, m) corresponding to fixed values of / and m (cf. chap. VI, §C-3-a), 
studying the eigenvalue equation of H in each of these subspaces (which is possible 
because H commutes with L 2 and L z ). The equation to be solved depends on /, 
but not on m; it is therefore the same in the (21 + 1) subspaces S(l, m) associated 
with a given value of /. We shall denote by E k l the eigenvalues of H t , that is, the 
eigenvalues of the Hamiltonian H inside a given subspace S(l, m). The index k, 
which can be discrete or continuous, represents the various eigenvalues associated 
with the same value of /. As for the eigenfunctions of H t , we shall label them with 
the same two indices as the eigenvalues: R k ,(r). It is not obvious that this is 
sufficient: several radial functions might exist and be eigenfunctions of the same 
operator H i with the same eigenvalue E k l ; we shall see in § 3-b that this is not the 
case and that, consequently, the two indices k and / are sufficient to characterize the 
different radial functions. We shall therefore rewrite equation (A-21) in the form : 




h 2 1 d 2 , 1(1 + l)h 2 
2jx r dr 2 2p r 2 



R k,l( r ) ~ R k,l Rk,l( r ) 


(A-22) 


We can simplify the differential operator to be studied by a change in 
functions. We set: 

R k.i( r ) = 7 “k,i( r ) (A-23) 

Multiplying both sides of (A-22) by r, we obtain for u k l (r) the following differential 
equation: 



(A-24) 


This equation is analogous to the one which we would have to solve if, 
in a one-dimensional problem, a particle of mass n were moving in an effective 
potential V eS (r) such that: 


K S 


(r) = V(r) + 


?(/ + l)ft 2 

2p r 2 


(A-25) 


Nevertheless, we must not lose sight of the fact that the variable r can take on 
only non-negative real values. The term /(/ + \)h 2 /2pr 2 which is added to the 
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potential V(r) is always positive or zero; the corresponding force (equal to minus 
the gradient of this term) always tends to repel the particle from the force center O ; 
this is why this term is called the centrifugal potential (or centrifugal barrier). 
Figure 2 represents the shape of the effective potential V eS (r) for various values of l 
in the case in which V(r) is an attractive Coulomb potential [F(r) = — e 2 /r~\ : 
for / ^ 1, the presence of the centrifugal term, which predominates for small r values, 
causes Ks to be repulsive for short distances. 



FIGURE 2 


Shape of the effective potential Kn W for the first 

values of / in the case where V(r) = ——. When 

/ = 0, V M (r) is simply equal to V(r). When / 
takes on the values 1, 2, etc., V eft (r) is obtained 
by adding to V(r) the centrifugal potential 

1(1 + l )* 2 ... 

--—, which approaches + oo when r 

2 fir 2 

approaches zero. 


c. BEHAVIOR OF THE SOLUTIONS OF THE RADIAL EQUATION 

AT THE ORIGIN 

We have already pointed out that it is necessary to examine the behavior of 
the solutions R(r) of the radial equation (A-21) at the origin in order to know if 
they are really solutions of (A-14). 

We shall assume that when r approaches zero, the potential V(r) remains 
finite, or at least approaches infinity less rapidly than 1/r (this hypothesis is true 
in most cases encountered in physics and, in particular, in the case of the Coulomb 
potential, to be studied in §C). We shall consider a solution of (A-22) and assume 
that it behaves at the origin like r s : 

( A * 26 ) 

Substituting (A-26) into (A-22), and setting the coefficient of the dominant term 
equal to zero, we obtain the equation: 

- s(s + 1) + /(/ + 1) = 0 (A-27) 
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and, consequently: 

f either s = l (A-28-a) 

[or j=-(/+l) (A-28-b) 

For a given value of E k ,, there are therefore two linearly independent 
solutions of the second-order equation (A- 22 ), behaving at the origin like r l and 
l/r' +1 , respectively. But those which behave like ljr l + 1 must be rejected, since 

it can be shown that Y™(6, (p) is not a solution of the eigenvalue equation (A-14) 

for r = 0*. From this, we see that acceptable solutions of (A-24) go to zero at 
the origin for all /, since: 

»*» r ~ 0 C' J + ' (A-29) 

Consequently, to equation (A-24) must be added the condition: 


w m(0) = 0 


(A-30) 


comment: 

In equation (A-24), r, the distance of the particle from the origin, varies only 
between 0 and + oo. However, thanks to condition (A-30), we can assume that we are 
actually dealing with a one-dimensional problem, in which the particle can theoretically 
move along the entire axis, but in which the effective potential is infinite for all negative 
values of the variable. We know that, in such a case, the wave function must be identi¬ 
cally zero on the negative half-axis; condition (A-30) insures the continuity of the 
wave function at r — 0. 


3. Stationary states of a particle in a central potential 
a. QUANTUM NUMBERS 

We can summarize the results of §2 as follows: the fact that the poten¬ 
tial V(r ) is independent of 6 and <p makes it possible: 

(i) to require the eigenfunctions of H to be simultaneous eigenfunctions 
of L 2 and L z , which determines their angular dependence : 

<Pm» = W <P) = <P) (A-31) 

(ii) to replace the eigenvalue equation of H, an equation involving partial 
derivatives with respect to r, 6, (p, by a differential equation involving only the 
variable r and depending on a parameter l [equation (A-24)], with condition (A-30) 
imposed. 


end of §4). 


This is because the Laplacian of — Y"(6, <p) 


involves the /th derivatives of <i(r) (cj. appendix II, 
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These results can be compared with those recalled in § 1 -a, of which they are 
the quantum mechanical analogues. 

In principle, the functions <p ktt<m (r, 8 , <p) must be square-integrable, that is, 
normalizable: 

f \<P k ,,Jr,8,(p)\ 2 r 2 drdQ = 1 (A-32) 


Their form (A-31) allows us to separate radial and angular integrations: 


n (r, 8, <p 


pr 2 dr df2 = 


dr |* w (r)| 3 


"(8, <pf 


(A-33) 


But the spherical harmonics Y™(8, <p) are normalized with respect to 8 and <p; 
condition (A-32) therefore reduces to: 




= 1 


(A-34) 


Actually, we know that it is often convenient to accept eigenfunctions of 
the Hamiltonian which are not square-integrable. If the spectrum of H has a 
continuous part, we shall require only that the corresponding eigenfunctions be 
orthonormalized in the extended sense, that is, that they satisfy a condition of the 
form: 

/• 00 /% oo 

r 2 drR*ir) R kJ (r) = dr u&r) u M (r) = 5(k' - k) (A-35) 

Jo Jo 


where k is a continuous index. 

In (A-34) and (A-35), the integrals converge at their lower limit, r = 0 [condi¬ 
tion (A-30)]. This is physically satisfying since the probability of finding the 
particle in any volume of finite dimensions is then always finite. It is therefore 
only because of the behavior of the wave functions for r —*• oo that, in the case of 
a continuous spectrum, the normalization integrals (A-35) diverge if k = k'. 

Finally, the eigenfunctions of the Hamiltonian H of a particle placed in 
a central potential V(r) depend on at least three indices [formula (A-31)]: 
<p M m (r, 8, <p) = R k ,(r)T*(0, q>) is a simultaneous eigenfunction of H, L 2 and L z 
with the respective eigenvalues E k l , 1(1 + 1 )h 2 and nih. k is called the radial 
quantum number; /, the azimuthal quantum number; and m, the magnetic quantum 

number. The radial part R k ,(r) = — u k l (r) of the eigenfunction and the eigen¬ 
value E k t of H are independent of the magnetic quantum number and are given 
by the radial equation (A-24). The angular part of the eigenfunction depends only 
on / and m and not on k ; it does not depend on the form of the potential V(r). 


b. DEGENERACY OF THE ENERGY LEVELS 

Finally, we shall consider the degeneracy of the energy levels, that is, of the 
eigenvalues of the Hamiltonian H. The (21 + 1) functions <p ki i im (r, 8, <p) with k 
and l fixed and m varying from — / to + l are eigenfunctions of H with the same 
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eigenvalue E kl [these (21 + 1) functions are clearly orthogonal, since they correspond 
to different eigenvalues of L z ]. The level E k l is therefore at least (21 + l)-fold 
degenerate. This degeneracy, which exists for all potentials V(r), is called an 
essential degeneracy : it is due to the fact that the Hamiltonian H contains L 2 , 
but not L*, which means that m does not appear in the radial equation. It is also 
possible for one of the eigenvalues E kl of the radial equation corresponding to 
a given value of l to be the same as an eigenvalue E k , v associated with another 
radial equation, characterized by l' # /. This occurs only for certain poten¬ 
tials V(r). The resulting degeneracies are called accidental (we shall see in §C 
that the energy states of the hydrogen atom present accidental degeneracies). 

We must now show that, for a fixed value of l, the radial equation has at 
most one physically acceptable solution for each eigenvalue E k v This actually results 
from condition (A-30). The radial equation, since it is a second-order differential 
equation, has a priori two linearly independent solutions for each value of E k t . 
Condition (A-30) eliminates one of them, so there is at most one acceptable solu¬ 
tion for each value of E k We must also consider the behavior of the solutions 
for r approaching infinity; if V(r) —*■ 0 when r — *■ oo, the negative values of E k l 
for which the solution we have just chosen is also acceptable at infinity (that is, 
bounded) form a discrete set (see example of §C below and complement B v „). 

It follows from the preceding considerations that H, L 2 and L z constitute 
a C.S.C.O.**. If we fix three eigenvalues E k l , l(l + 1 )h 2 and mb, there corresponds 
to them a single function <p k>(fm (r). The eigenvalue of L 2 indicates which equation 
yields the radial function; the eigenvalue of H determines this radial function R k ,(r) 
uniquely, as we have just seen; finally, there exists only one spherical har¬ 
monic T”(0, <p) for a given / and m. 


B. MOTION OF THE CENTER OF MASS 

AND RELATIVE MOTION FOR A SYSTEM 
OF TWO INTERACTING PARTICLES 


Consider a system of two spinless particles, of masses m, and m 2 and positions r 2 
and r 2 . We assume that the forces exerted on these particles are derived from 
a potential energy V(r l — r 2 ) which depends only on r t — r 2 . This is true if there 
are no forces originating outside the system (that is, the system is isolated), and if the 
interactions between the two particles are derived from a potential. This potential 
must depend only on r, — r 2 , since only the relative positions of the two particles 
are involved. We shall show that the study of such a system can be reduced to 
that of a single particle placed in the potential V(t). 


* This essential degeneracy appears whenever the Hamiltonian is rotation-invariant (cf. comple¬ 
ment By,). This is why it is encountered in numerous physical problems. 

** Actually, we have not proven that these operators are observables, that is, that the set 6f 
<Pk.i.m(*) form a basis in the state space <?,. 
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1. Motion of the center of mass and relative motion 
in classical mechanics 


In classical mechanics, the two-particle system is described by the 
Lagrangian (cf appendix III): 

jr(rj, r x ; r 2 , f 2 ) = T - V = j m t r f + ^m 2 rf - F(r x - r 2 ) (B-l) 

and the conjugate momenta of the six coordinates of the two particles are the 
components of the mechanical momenta: 

Pi = "bii 

p 2 = m 2 r 2 (B-2) 

The study of the motion of the two particles is simplified by replacing the 
positions r f by the three coordinates of the center of mass (or center of gravity): 

_ aa + (b-3) 

Wj + m 2 

and the three relative coordinates*: 


Formulas (B-3) and (B-4) can be inverted to yield: 


r x = r G +--f— r 

m, 4- m 2 


■2*0 „ , i ' 

ntj + m 2 

The Lagrangian can then be written, in terms of the new variables r G and r : 


•\ 1 r. , m 2 .1 

j, r G j m ,^ r c +^“r^ r J 


n 2 i r ~i 2 

• . 1 * m i d 

rj J_ r j - K(r) 


= jMil + j(i r 2 - F(r) 


where: 


M = /«! + m 2 

is the total mass of the system, and: 


m x + m 2 


(B-8-a) 


* Definition (B-4) introduces a slight asymmetry between the two particles. 
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is its reduced mass (the geometrical mean of the two masses m, and m 2 ), which 
is also given by: 

-!=— +— (B-8-b) 

H Wj m 2 

The conjugate momenta of the variables r G and r are obtained by differen¬ 
tiating expression (B-6) with respect to the components of r G and r. Using (B-3), 
(B-4) and (B-2), we find: 


Po = M't = »| f, + rn,r 2 = p, + P; 

(B-9-a) 

r = lt i= m *' ~ m ' f ‘ 
m, 4- m 2 

(B-9-b) 

P = _Pi_ _ JU 

(B-9-c) 

ju m, m 2 


p G is the total momentum of the system, and p is called the relative momentum 
of the two particles. 

We can then express the classical Hamiltonian of the system in terms of the 
new dynamical variables which we have just introduced: 

^(r G ,P G ;r,p) + ^(r) (B-10) 

From this, the equations of motion can immediately be derived [formulas (27) 
of appendix III]: 

p G =0 (B-11) 

p =-VF(r) (B-12) 

The first term of expression (B-10) represents the kinetic energy of a fictitious 
particle whose mass M would be the sum m, + m 2 of the masses of the two real 
particles, whose position would be that of the center of mass of the system [for¬ 
mula (B-3)], and whose momentum p G would be the total momentum p x + P 2 
of the system. Equation (B-11) indicates that this fictitious particle is in uniform 
rectilinear motion (free particle). This result is well known in classical mechanics: 
the center of mass of a system of particles moves like a single particle whose mass 
is the total mass of this system, subjected to the resultant of all the forces exerted 
on the various particles. Here, this resultant is zero since the only forces present 
are internal ones obeying the principle of action and reaction. 

Since the center of mass is in uniform rectilinear motion with respect to the 
initially chosen frame, the frame in which it is at rest (p G = 0) is also an inertial 
frame. In this center of mass frame, the first term of (B-10) is zero. The classical 
Hamiltonian, that is, the total energy of the system, then reduces to : 

K=\1+ V(t) (B-13) 

is the energy associated with the relative motion of the two particles. It is 
obviously this relative motion that is the most interesting in the study of the 
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two interacting particles. It can be described by introducing a fictitious particle, 
called the relative particle : its mass is the reduced mass p of the two real particles, 
its position is characterized by the relative coordinates r, and its momentum is the 
relative momentum p. Since its motion obeys equation (B-12), it behaves as if 
it were subject to a potential V(r) equal to the potential energy of interaction 
between the two real particles. 

The study of the relative motion of two interacting particles therefore reduces 
to that of the motion of a single fictitious particle, characterized by formulas (B-4), 
(B-8) and (B-9-c). This last equation expresses the fact that the velocity p/p of the 
relative particle is indeed the difference between the velocities of the two particles, 
that is, their relative velocity. 


2. Separation of variables in quantum mechanics 

The considerations of the preceding section can easily be transposed to 
quantum mechanics, as we shall now show. 


a. OBSERVABLES ASSOCIATED WITH THE CENTER OF MASS 
AND THE RELATIVE PARTICLE 


The operators R,, P, and R 2 , P 2 , which describe the positions and momenta 
of the two particles of the system, satisfy the canonical commutation relations: 

[x l ,p 1 J = ih 

[_X 2 ,P 2x ] = ih (B-14) 


with analogous expressions for the components along Oy and Oz. All the obser¬ 
vables labeled by the index 1 commute with all those of index 2, and all the 
observables relating to one of the axes Ox, Oy or Oz commute with those 
corresponding to another one of these axes. 

Now let us define the observables R G and R by formulas similar to (B-3) 
and (B-4): 


i^R, +m 2 R 2 
G m, +m 2 
R = Rj — R 2 


(B-15-a) 

(B-15-b) 


and the observables P G and P by formulas similar to (B-9): 

P G = P, + P 2 

p = m 2 P i - m i P 1 
m, 4- m 2 

It is easy to calculate the various commutators of these new observables. The 
results are as follows: 

= * 

[x, p x ] = jft 
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(B-16-a) 

(B-16-b) 
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with analogous expressions for the components along Oy and Oz; all the other 
commutators are zero. Consequently, R and P, like R c and P G , satisfy canonical 
commutation relations. Moreover, every observable of the set { R, P } commutes 
with every observable of the set {R e ,P G }. 

We can also interpret R and P, on the one hand, and R G and P G , on the other, 
as being the position and momentum observables of two distinct fictitious particles. 

b. EIGENVALUES AND EIGENFUNCTIONS OF THE HAMILTONIAN 

The Hamiltonian operator of the system is obtained from formulas (B-l) 
and (B-2) and the quantization rules of chapter III: 


Since definitions (B-15) and (B-16) are formally identical to (B-3), (B-4) and (B-9), 
and since all the momentum operators commute, a simple algebraic calculation 
yields the equivalent of expression (B-10). 


02 p2 

H = iti + T» +v W 

(B-l 9) 

The Hamiltonian H then appears as the sum of two terms : 


H g + H r 

(B-20) 

with: 


p2 

rr _ r G 

0 “ 2M 

(B-21-a) 

"■~T, + v (R) 

(B-21-b) 

which commute, according to the results of § a: 


[H g , H r ] = 0 

(B-22) 


H a and H r therefore commute with H. It follows that there exists a basis of 
eigenvectors of H which are also eigenvectors of H a and H r ; we shall therefore 
look for solutions of the system: 

H G I (p > = E g I <P > 

H r \(p)=E r \(p) (B-23) 

which immediately implies, according to (B-20): 

H \(p > = E | (p > (B-24) 

with: 

E = E g + E r (B-25) 

Consider the { | r G , r ) } representation, whose basis vectors are the eigen¬ 
vectors common to the observables R G and R. In this representation, a state is 
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characterized by a wave function cp( r G , r) which is a function of six variables. The 
action of the operators R G and R is expressed by the multiplication of the wave 
functions by the variables r G and r respectively. P G and P become the differential 
ft ft 

operators -r V G and j V (where V G denotes the set of three operators d/dx G , c/dy G 

and d/dz G ). The state space $ of the system can then be considered to be the 
tensor product <f rG ® S t of the state space S r< _ associated with the observable R G 
and the space S, associated with R. H G and H r then appear as the extensions 
into $ of operators actually acting only in S, G and S t respectively. We can therefore, 
as we saw in §F of chapter II, find a basis of eigenvectors | (p ) satisfying (B-23), 
in the form: 

| (P > = | X G > ® I W r ) ( B ‘ 26 ) 

with: 

r Hq i xg y = e g i xg y 

\ Ug > e &, G (B-27-a) 

r H r | (0 r > = E r I 0i r > 

1 | co r ) e S t (B-27-b) 


Writing these equations in the { | r G > } and { | r > } representations respectively, 
we obtain: 


— ^gXg^g) — EgXg( t g) 

+ V (r)Jco P (r) = E r ct) r (r) 


(B-28-a) 

(B-28-b) 


The first of these equations, (B-28-a), shows that the particle associated 
with the center of mass of the system is free, as in classical mechanics. We know its 
solutions: they are, for example, the plane waves: 


X G ( r a) 


_1_ 

(27rfi) 3/2 



r G 


(B-29) 


whose energy is equal to: 


E g = 


Pg 

2 M 


(B-30) 


E g can take on any positive value or zero; it is the kinetic energy corresponding 
to a translation of the system as a whole. 

The more interesting equation from a physical point of view is the second 
one, (B-28-b), which concerns the relative particle. It describes the behavior of the 
system of the two interacting particles in the center of mass frame. If the interaction 
potential of the two real particles depends only on the distance between them, 
| r i — r 2 |, and not on the direction of the vector r x — r 2 , the relative particle is 
subject to a central potential V(r); the problem is then reduced to the one treated 
in §A. 
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comment: 

The total angular momentum of the system of the two real particles is : 
J = L, + L 2 (B-31) 

with: 

U = R, X P, 

L 2 = R 2 x P 2 (B-32) 

It can easily be shown that it can also be written: 

J = L G + L (B-33) 

where: 

L a = x 

L = HxP (B-34) 

are the angular momenta of the fictitious particles (according to the results 
of § a, L G and L satisfy the commutation relations which characterize angular 
momenta, and the components of L commute with those of L G ). 


C. THE HYDROGEN ATOM 

1. Introduction 


The hydrogen atom consists of a proton, of mass: 


m p = 1.7 x 10- 27 kg 

(C-l) 

and charge: 


q = 1.6 x 10" 19 Coulomb 

(C-2) 

and of an electron, of mass: 


m e = 0.91 x 10 -30 kg 

(C-3) 

and charge — q. The interaction between these two particles 
static. The corresponding potential energy is : 

is essentially electro- 

w 7 

(C-4) 

where r denotes the distance between the two particles, and: 


4, - ^ 

(C-5) 
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Using the results of §B, we confine ourselves to the study of this system 
in the center of mass frame. The classical Hamiltonian which describes the relative 
motion of the two particles is then* : 


JT(r, p) 


p! 

In 


e 

r 


(C-6) 


Since m p > m e [formulas (C-l) and (C-3)], the reduced mass fx of the system is 
very close to m e \ 


H = 


m e +m p 


— 



{C-l) 


(the correction term mjm p is on the order of 1/1 800). This means that the center 
of mass of the system is practically in the same place as the proton, and that the 
relative particle can be identified, to a very good approximation, with the 
electron. This is why we shall adopt the slightly inaccurate convention of calling 
the relative particle the electron and the center of mass the proton. 


2. The Bohr model 

We shall briefly review the results of the Bohr model which relate to the 
hydrogen atom. This model, which is based on the concept of a trajectory, is 
incompatible with the ideas of quantum mechanics. However, it allows us to 
introduce, in a very simple way, fundamental quantities such as the ionization 
energy E t of the hydrogen atom and a parameter which characterizes atomic 
dimensions (the Bohr radius a 0 ). In addition, it so happens that the energies E n 
given by the Bohr theory are the same as the eigenvalues of the Hamiltonian which 
we shall calculate in §3. Finally, quantum mechanical theory is in agreement 
with some of the intuitive images of the Bohr model (cf. §4-c-P below). 

This semi-classical model is based on the hypothesis that the electron describes 
a circular orbit of radius r about the proton obeying the following equations: 


r 1 2 

E = -nv 2 

r 

(C- 8 ) 

It: 

I W 

II 

1 ^ 


(C-9) 

r r 2 

nvr = rih ; 

n a positive integer 

(C-10) 


The first .two equations are classical ones. (C- 8 ) expresses the fact that the total 
energy E of the electron is the sum of its kinetic energy ixv 2 /2 and its potential 
energy — e 2 /r. (C-9) is none other than the fundamental equation of Newtonian 
dynamics {e 2 /r 2 is the Coulomb force exerted on the electron, and v 2 /r is the accele¬ 
ration of its uniform circular motion). The third equation expresses the quantization 
condition, introduced empirically by Bohr in order to explain the existence of 


* Henceforth, we shall omit the index r which was used to label in § B the quantities corresponding 
to the relative motion. 
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discrete energy levels : he postulated that only circular orbits satisfying this 
condition are possible trajectories for the electron. The different orbits, as well as 
the corresponding values of the various physical quantities, are labeled by the 
integer n which is associated with them. 

A very simple algebraic calculation then yields the expressions for E n , r„ and v n : 

E - -if 
" " n 2 1 


(C-ll-a) 

r„ = n 2 a 0 


(C-ll-b) 

v ■ 


(C-ll-c) 

with: 



E 

1 2h 2 


(C-12-a) 

h 2 

a O = 2 

tie 2 


(C-12-b) 

'\ j |« 

II 

O 


(C-12-c) 


When this model was proposed by Bohr, it marked an important step 
towards the understanding of atomic phenomena, since it yielded the correct values 
for the energy levels of the hydrogen atom. These values indeed follow the 1 jn 2 
(the Balmer formula) law indicated by expression (C-ll-a). Moreover, the 
experimentally measured ionization energy (the energy which must be supplied 
to the hydrogen atom in its ground state in order to remove the electron) is equal 
to the numerical value of E,: 

E, ~ 13.6 eV (C-13) 

Finally, the Bohr radius a 0 indeed characterizes atomic dimensions: 

a 0 ~ 0.52 A (C-14) 

comment: 

Complement C, shows how the uncertainty principle, applied to the hydrogen 
atom, explains the existence of a stable ground state and permits the evaluation 
of the order of magnitude of its energy and its spatial extension. 


3. Quantum mechanical theory of the hydrogen atom 


We shall now take up the question of the determination of the eigenvalues 
and eigenfunctions of the Hamiltonian H which describes the relative motion of 
the proton and the electron in the center of mass frame [formula (C-6)]. In 
the { | r > } representation, the eigenvalue equation of the Hamiltonian H is written: 


ft 2 e 2 ~l 

--A-— <p(r) = E <p( r) 


(C-15) 
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Since the potential — e 2 /r is central, we can apply the results of §A : the eigen¬ 
functions <p(r) are of the form: 


< P*,i,m( r ) — r U k,l( r ) *p) 

u k ,(r) is given by the radial equation, (A-24), that is: 

[ h 2 d 2 /(/ + i)/j2 e 2-i _ 

L 2 M dr 2 + 2n r 2 r J U ) " M 


(C-16) 


(C-17) 


We add to this equation condition (A-30): 

W kj(0) = 0 


(C-18) 


It can be shown that the spectrum of H includes a discrete part (negative eigenvalues) 
and a continuous part (positive eigenvalues). Consider figure 3, which shows the effective 
potential for a given value of / (the figure is drawn for / ^ 0, but the reasoning remains valid 
for / = 0). 

For a positive value of E y the classical motion is not bounded in space: for the 
value E > 0 chosen in figure 3, it is limited on the left by the abscissa of point A , but it is not 
limited on the right * As a result (cf complement M m ) equation (C-17) has acceptable solutions 
for any E > 0. The spectrum of H is therefore continuous for E > 0, and the corresponding 
eigenfunctions are not square-integrable. 

On the other hand, for E < 0, the classical motion is bounded: it is confined to the 
region between the abscissas of the two points B and C** We shall see later that equation (C-17) 
has acceptable solutions only for certain discrete values of £. The spectrum of H is therefore 
discrete for E < 0, and the corresponding eigenfunctions are square-integrable. 



FIGURE 3 

For a positive value of the energy E, the 
classical motion is not bounded. The spec¬ 
trum of the quantum mechanical Hamilto¬ 
nian H is therefore continuous for E > 0, 
and the corresponding eigenfunctions are 
not normalizable. On the other hand, for 
negative £, the classical motion is limited 
to the interval BC> The spectrum of H is 
therefore discrete for E < 0, and the cor¬ 
responding eigenfunctions are normaliz¬ 
able. 


* For a — \jr potential, the classical trajectories are conic sections; unbounded motion follows a 
hyperbola or a parabola. 


** The classical trajectory is then an ellipse or a circle. 
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a. CHANGE OF VARIABLES 


To simplify the reasoning, we shall choose a 0 and E l [formulas (C-12)] as 
the units of length and energy. That is, we shall introduce the dimensionless 
quantities: 

9 = r/a 0 (C-19) 


4. = V- Eu/E, (C-20) 

(the quantity under the radical sign is positive, since we are looking for the bound 
states). 

With expressions (C-12-a) and (C-12-b) for Ej and a 0 taken into account, 
the radial equation (C-17) becomes simply: 



KL+ 1 ) 



Mfc.l(p) = 0 


(C-21) 


b. SOLVING THE RADIAL EQUATION 


In order to solve equation (C-21), we shall use the method illustrated in 
complement C v , expanding u k ,(p) in a power series. 

a. Asymptotic behavior 

Let us determine the asymptotic behavior of u k ,(p) qualitatively. When p 
approaches infinity, the terms in 1/p and 1/p 2 become negligible compared to 
the constant term X k ,, so that equation (C-21) practically reduces to: 

[dp 1 ~~ ^'] Uk ' l{f>) = ° (C ’ 22) 

whose solutions are e ±pA,< - 1 . This argument is not rigorous, since we have completely 
neglected the terms in 1/p and 1/p 2 ; actually, it can be shown that u k ,(p) is equal to 
e ±pA *■' multiplied by a power of p. 

We shall later be led by physical considerations to require the function u k l (p) 
to be bounded at infinity, and hence to reject the solutions of (C-21) whose 
asymptotic behavior is governed by e +pAk -'. This is why we perform the change of 
function: 

= e- pAk -%.,(p) (C-23) 

Although this change of function singles out e~ pA *-', it clearly does not eliminate 
solutions in e +pAk -', which must be identified and then rejected at the end of the 
calculation. The differential equation which >Y .(p) must satisfy can easily be 
derived from (C-21): 





ykAp ) = o 


(C-24) 
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Condition (C-18) must be associated with this equation, that is: 

>V«( 0 ) = 0 


(C-25) 


p. Solutions in the form of power series 

Consider the expansion of y kl {p) in powers of p: 

y k ,i(p) = P s Z p q 

4 = 0 

By definition, c 0 is the first non-zero coefficient of this expansion: 
c 0 ^ 0 

Condition (C-25) implies that s is strictly positive, 
d d 2 

We calculate — y k ,(p) and - y k ,(p) from (C-26): 

dp ’ dp 2 ’ 


4-^./(p) = I (P +s)c q p‘ l+s - 1 

q = 0 

-j-^y k ,i(p) = £ (q +s)(g +s - i)c p 9+s ~ 2 
dp 2 4=0 


(C-26) 


(C-27) 


(C-28-a) 


(C-28-b) 


To obtain the left-hand side of (C-24), we multiply expressions (C-26), (C-28-a) 

and (C-28-b) respectively by the factors — — — — , — 2X kl and 1. According 

LP P 2 J 

to (C-24), the series so determined must be identically zero, that is, all its 
coefficients must be zero. 

The lowest order term is in p s_2 . Taking its coefficient as zero, we obtain: 
[-/(/+ 1 ) + s(s - 1 )> 0 = 0 (C-29) 

If we take (C-27) into account, we see that s can take on one of two values: 

f s = / + 1 (C-30-a) 

\s -/ (C-30-b) 

(in agreement with the general result of §A- 2 -c). We have seen that only (C-30-a) 
gives a behavior at the origin which can lead to an acceptable solution [condition 
(C-25)]. Setting the coefficient of the general term in p 9+s ~ 2 equal to zero, we 
obtain (with s = l + 1 ) the following recurrence relation : 

q{q + 21 + 1) c f = 2[(q + l)A kJ - (C-31) 


If we fix c 0 , this relation enables us to calculate c t , then c 2 , and thus by 
recurrence all the coefficients c q . Since cjc q _ 1 approaches zero when q —> oo, 
the corresponding series is convergent for all p. Thus we have determined, for 
any value of X k ,, the solution of (C-24) which satisfies condition (C-25). 
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c. ENERGY QUANTIZATION. RADIAL FUNCTIONS 


We are now going to require the preceding solution to have a physically 
acceptable asymptotic behavior (cf. §b-a below). This will involve quantization 
of the possible values of l kl . 

If the term in brackets on the right-hand side of (C-31) does not go to zero 
for any integer q , expansion (C-26) is a true infinite series, for which: 


c a , 

-a— /v — ■ " * 

Now, the power series expansion of the function e 2pik -' is written: 


(C-32) 


e 2pXk -' = X p q 
I 4 = 0 

' j (Mil. 

• 

which implies: 



(C-33) 


(C-34) 


If we compare (C-32) and (C-34), it is easy to see* that, for large values of p, 
the series being considered behaves like e 2pXkl . The corresponding function u k l 
[formula (C-23)] is then proportional to e +p * k ', which is not physically acceptable. 

Consequently, we must reject all cases in which expansion (C-26) is an 
infinite series. The only possible values of X k , are those for which (C-26) has 
only a finite number of terms, that is, those for which y kt reduces to a polynomial. 
The corresponding function u k , is then physically acceptable, since its asymptotic 
behavior is dominated by e~ pXk l . Therefore, all we need is an integer k such 
that the term in brackets of the right-hand side of (C-31) goes to zero for 
q = k: the corresponding coefficient c k is then zero, as are all those of higher 
order, since the fact that c k is zero means that c k + 1 is as well, and so on. For fixed /, 
we label the corresponding values of X kl by this integer k (note that k is 
greater than or equal to 1, since c 0 never goes to zero). We then have, according 
to (C-31): 

< c - 35 > 


For a given /, the only negative energies possible are therefore [formula (C-20)] : 


E 


k,l ~ 


- £/ 
(k +1) 2 


; k = 1, 2, 3,... 


(C-36) 


We shall discuss this result in §4. 


* The reader can find a fuller discussion relating to an analogous problem in complement C v . 
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y k l is therefore a polynomial, whose term of lowest order is in p' +1 and 
whose term of highest order is in p k+l . Its various coefficients can be calculated 
in terms of c 0 by solving recurrence relation (C-31), which can be written, 
using (C-35): 


_ 2(k - q) 

Cq ~ q(q +21 + tyk +l) Cq -' 


It is easy to show that: 


c, = (- i Y 


(k ~ 1 )! 


( 21 + 1 )! 


k + l) (k - q - 1)! q'.(q +21 + 1)! 0 


(C-3 7) 


(C-38) 


u k,iiP) is then given by formula (C-23), and c 0 is determined (to within a 
phase factor) by normalization condition (A-34) [we must first, of course, return 
to the variable r by using (C-19)]. Finally, we obtain the true function /? k ( (r) 
by dividing u kt (r) by r. The following three examples give an idea of the form of 
these radial functions: 


■oM = 2 («o) 


‘ 3/2 


^k = 2,/ = o( r ) — 2(2a 0 ) 3/2 ^l 2a e 
^it=i,!= i( r ) = (2a 0 ) _3/2— 7=“" e_r/2< “ 0 

v 3 a o 


(C-39-a) 

(C-39-b) 


(C-39-c) 


4. Discussion of the results 

a. ORDER OF MAGNITUDE OF ATOMIC PARAMETERS 

Formulas (C-36) and (C-39) show that, for the hydrogen atom, the ionization 
energy Ej, defined by (C-12-a), and the Bohr radius, given by (C-12-b), play an 
important role. These quantities give an order of magnitude of the energies and 
spatial extensions of the wave functions associated with the bound states of the 
hydrogen atom. 

(C-12-a) and (C-12-b) can be written in the form: 


£, = 2 «V 


(C-40-a) 


«o - - K 

u a c 


(C-40-b) 


where a is the fine structure constant, which is a dimensionless constant which 
plays a very important role in physics: 


a he 4neJic 


and where X c is defined by: 

ILC 


(C-41) 


( 042 ) 
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Since p is almost the same as m e , the rest mass of the electron, X c is practically 
equal to the Compton wavelength of the electron, which is given by : 

— ^ 3.8 x 10" 3 A (C-43) 

m e c 

Relation (C-40-b) therefore indicates that a 0 is on the order of one hundred 
times the Compton wavelength of the electron. Relation (C-40-a) shows that the order 
of magnitude of the binding energy of the electron is between 10 ~ 4 pc 2 and I0~ ! pc 2 , 
where pc 2 is practically equal to the rest energy of the electron: 

m e c 2 m 0.51 x 10 6 eV (C-44) 

It follows that : 

Ej < m e c 2 (C-45) 

This justifies our choice of the non-relativistic Schrodinger equation to describe 
the hydrogen atom. Of course, relativistic effects, although small, do exist; 
nevertheless, their smallness allows them to be studied by perturbation theory 
(cf. chap. XI and XII). 

b. ENERGY LEVELS 

a. Possible values of the quantum numbers; degeneracies 

For fixed /, there exists an infinite number of possible energy values 
[formula (C-36)], corresponding to k = 1, 2, 3, ... Each of them is at least 
(21 + l)-fold degenerate : this is an essential degeneracy related to the fact that the 



FIGURE 4 

Energy levels of the hydrogen atom. The 
energy E„ of each level depends only on n. 
If n is fixed, several values of / are pos¬ 
sible : / = 0, I, 2,.... n - 1. To each of 
these values of l correspond (2/ + 1) pos¬ 
sible values for m : 

m = —/, — /+ 1, .... I. 

Consequently, the level E n is n 2 -fold 
degenerate. 
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radial equation depends only on the quantum number l and not on m (§A-3). But, 
in addition, there exist accidental degeneracies : equation (C-36) indicates that two 
eigenvalues E k l and E k . r corresponding to different radial equations (/'#/) 
are equal if k + l = k' + l'. Figure 4, in which the first eigenvalues associated 
with 1 = 0, 1,2 and 3 are shown on a common energy scale, clearly reveals 
several accidental degeneracies. 

In the special case of the hydrogen atom, E k t does not depend on k and / 
separately, but only on their sum. We set: 

n = k + l (C-46) 

The various energy states are labeled by the integer n (greater than or equal to 1), 
and (C-36) becomes: 

E, = - ~ E, (C-47) 

n z 

According to (C-46), it is equivalent to specify k and l or n and / to determine 
the eigenfunctions. Following convention, from now on we shall use the quantum 
numbers n and /. The energy is fixed by n, which is called the principal quantum 
number ; a given value of n characterizes what is called an electron shell. 

Since k is necessarily an integer which is greater than or equal to 1 (§3-c 
above), there is only a finite number of values of / associated with the same value 
of n. According to (C-46), if n is fixed, one can have: 

1 = 0,1,2 . n-1 (C-48) 

The shell characterized by n is said to contain n sub-shells *, each one corresponding 
to one of the values of / given in (C-48). Finally, each sub-shell contains (21 4- 1) 
distinct states, associated with the (21 + 1) possible values of m for fixed /. 

The total degeneracy of the energy level E n is therefore: 

9n = "l (2/ + 1) = + n = n 2 (C-49) 

( = 0 L 

We shall see in chapter IX that the existence of electron spin multiplies this 
number by 2 (if we also take into account the proton spin, which is equal to that 
of the electron, we obtain another factor of 2). 


p. Spectroscopic notation 

For historical reasons (dating from the period, before the development of 
quantum mechanics, in which the study of spectra resulted in an empirical 


* The concept of a sub-shell can even be found in the semi-classical model of Sommerfeld. This 
model assigns to each value n of Bohr's quantum number n elliptical orbits of the same energy and dif¬ 
ferent angular momenta. One of these orbits is circular; it is the one which corresponds to the maximum 
value of the angular momentum. 
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classification of the numerous lines observed), letters of the alphabet are associated 
with the various values of /. The correspondence is as follows: 

l = 0 s 
l = 1 *-*■ p 
1 = 2 ^ d 
1=3 ^ f 

l = 4 ^ g (C-50) 


alphabetical order 

Therefore, spectroscopic notation labels a sub-shell by the corresponding 
number n followed by the letter which characterizes the value of /. Thus, the 
ground level [which is non-degenerate, according to (C-49)], sometimes called 
the “AT shell”, includes only the Is sub-shell; the first excited level, or “L shell”, 
includes the 2s and 2p sub-shells; the second excited level (“A/ shell”) includes 
the 3s, 3 p and 3 d sub-shells, etc. (The capital letters sometimes associated with the 
successive shells follow an alphabetical order, starting with the letter K .) 

c. WAVE FUNCTIONS 

The wave functions associated with the eigenstates common to L 2 , L z and 
the Hamiltonian H of the hydrogen atom are generally labeled, not by the three 
quantum numbers k, /, m, as we have done until now, but by n, l and m [passage 
from one set to the other simply involves use of relation (C-46)]. Since the 
operators H, L 2 and L z constitute a C.S.C.O. {cf §A-3), specification of the 
three integers n , / and m, which is equivalent to that of the eigenvalues of H, L 2 and 
L., unambiguously determines the corresponding eigenfunction (p n , m (r). 

a. Angular dependence 

As is the case for any central potential, the functions </> n l m (r) are products 
of a radial function and a spherical harmonic F"(0, <p). To visualize their angular 
dependence, we can measure off a distance on the axis characterized by the polar 
angles 0 and cp which is proportional to |<p„, m (r, 0 , cp)| 2 for any fixed r, that is, 
proportional to | K"(0, <p)\ 2 . Thus, we obtain a surface of revolution about the Oz 
axis, since we know that T”(0, (p) depends on <p only through the factor 
e(§D-l-b of chapter VI); consequently, | *7(0, <p) | 2 is independent of <p. 
We can therefore represent its cross-section by a plane containing Oz. This is what 
is done in figure 5, for m = 0 and l = 0, 1 and 2 [the corresponding spherical 
harmonics are given in complement A VI , formulas (31), (32) and (33)] : Y% is a 
constant, and is therefore spherically symmetric; |T ?| 2 is proportional to cos 2 0 , 
and |r5| 2 , to (3 cos 2 0 — l) 2 . 

P- Radial dependence 

The radial functions R n fir), each of which characterizes a sub-shell, can be 
calculated from the results of §3-c, paying attention, however, to the change in 
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Z 



l = 0 
m = 0 



FIGURE 5 

Angular dependence, >7(0, <p), of some 
stationary wave functions of the hydrogen 
atom, corresponding to well-defined values 
of / and m. For each direction of polar 
angles 0 , <p, the value of \Y^(9,q>)\ 2 is 
recorded; a surface of revolution about 
the Oz axis is thus obtained. When / = 0, 
this surface is a sphere centered at O; it 
becomes more complicated for higher 
values of /. 


notation introduced by formula (C-46). Figure 6 represents the variation with 
respect to r of the three radial functions given in (C-39): 

^Je = l,J = 0 = = 1,1 = 0 ! ^k = 2,1 = 0 = R/i = 2,1 = 0 ’ ^*=1,1=1 = = 2,1= 1 (C-51) 

The behavior of R nJ {r) in the neighborhood of r = 0 is that of r 1 (see 
discussion of §A-2-c). Consequently, only states belonging to s sub-shells (l = 0) 
give a non-zero probability of presence at the origin. The greater /, the larger the 
region around the proton in which the position probability of the electron is 
negligible. This has a certain number of physical consequences, particularly in 
the phenomenon of electron capture by certain nuclei, and in the hyperfine 
structure of lines (cf chap. XII, §B-2). 



FIGURE 6 

Radial dependence R n ,(r) of wave func¬ 
tions associated with the first few levels of 
the hydrogen atom. When r —> o, R„An 
behaves like r'\ only the s states (for 
which / = 0) have a non-zero position pro¬ 
bability at the origin. 
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Finally, we can derive formula (C-l1-b) for the successive Bohr radii. To do 
so, consider the various states for which / = n — 1 *. We calculate the variation 
with r of the probability density for each of the preceding levels in an infinitesimal 
solid angle dQ about a fixed direction of polar angles 6 and <p. In general, the 
position probability for the electron in the volume element d 3 r = r 2 dr di2 
situated at the point (r, 8, (p) is given by: 

<P) = | (p n ,i. m (r, 0, (p )| 2 r 2 dr d£2 

= |/?„»| 2 r 2 dr x |y-(0, <p) | 2 d Q (C-52) 


Here, we have fixed 6, q> and d Q. The probability of finding the electron between r 
and r + dr, inside the solid angle under consideration, is then proportional 
to r 2 ,(r)| 2 dr. The corresponding density is therefore, to within a constant 
factor, r 2 \R n ,(r)| 2 (the factor r 2 arises from the expression for the volume 
element in spherical coordinates). The cases which interest us here are those for 
which l = n — 1, that is , k = n — l = 1; §3-c then indicates that the polynomial 
which enters into R nJ (r) contains only one term in (r/a 0 )" - *. The desired proba¬ 
bility density is therefore proportional to: 



This function has a maximum for: 


(C-53) 


r 


= n 2 a 0 


(C-54) 


which is the radius of the Bohr orbit corresponding to the energy E n . 

Finally, the table below gives the expressions for the wave functions of the 
first energy levels: 


ly level 

m ^ p “ riao 

<P„=l,/ = 0,m = 0 *“ /-- C 

v na% 

2s level 

( Pn-2.l-0,m-0 / - \ e 

V Snal V ^ a °J 

2 p level 

= - 7 ==~e _r/2 “° sin 6 e‘ > 

8 v na\ a ° 

<Pn-2.l-l.m-0 = 7 e r/2fl °COS 8 

4 v 2nao a o 

<Pn=2,i=i,m=-i = - - n e r/2a ° sin 8 e •* 

8 \' nal a o 


* These states correspond to the circular orbits of the Sommerfeld theory (cf note on page 799). 
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References and suggestions for further reading: 

Particle in a central potential: Messiah (1.17), chap. IX; SchifF(1.18),§16. 

The Bohr-Sommerfeld atom and the old quantum therory: Cagnac and Pebay- 
Peyroula (11.2), chaps. V, VI and XIII; Bom (11.4), chap. V, §§1 and 2; Pauling and 
Wilson (1.9), chap. II; Tomonaga (1.8), Vol. I; Eisberg and Resnick (1.3), chap. 4. 

Hydrogen-like wave functions: Levine (12.3), §6.5; Karplus and Porter (12.1), 
§§3.8 and 3.10; Eisberg and Resnick (1.3), §§7.6 and 7.7. 

Degeneracy related to a \fr potential (dynamical group): Borowitz (1.7), §13.7; 
Schiff (1.18), §30; Bacry (10.31), §6.11. 

Mathematical treatment of differential equations : Morse and Feshbach (10.13), 
chaps. 5 and 6; Courant and Hilbert (10.11), vol. I, §V-11. 
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COMPLEMENTS OF CHAPTER VII 

Avh: HYDROGEN-LIKE SYSTEMS A V1I : presentation of various hydrogen-like sys¬ 

tems to which the calculations of chapter VII can 
be applied directly. The accent is placed on phys¬ 
ical discussions and on the influence of the 
masses of the particles involved in the system. 
Simple, advised for a first reading. 


B v „: A SOLUBLE EXAMPLE 

OF A CENTRAL POTENTIAL : 

THE ISOTROPIC THREE-DIMENSIONAL 
HARMONIC OSCILLATOR 


B vn : study of another case (a three-dimensional 
harmonic oscillator) in which it is possible to 
calculate exactly the energy levels of a particle 
in a central potential by the method of chapter VII 
(solution of the radial equation). Not theoret¬ 
ically difficult; may be considered as a worked 
example. 


Cvu: PROBABILITY CURRENTS ASSOCIATED 
WITH THE STATIONARY STATES 
OF THE HYDROGEN ATOM 


C VI1 : completes the results of §C-4-c of chap¬ 
ter VII concerning the properties of the stationary 
states of the hydrogen atom by calculating their 
probability currents. Short and simple, useful for 
complement D VI1 . 


Dvi,: THE HYDROGEN ATOM PLACED 
IN A UNIFORM MAGNETIC FIELD. 
PARAMAGNETISM AND DIAMAGNETISM. 
THE ZEEMAN EFFECT 


E VI ,: STUDY OF SOME ATOMIC ORBITALS. 
HYBRID ORBITALS 


F vi i: VIBRATIONAL-ROTATIONAL LEVELS 
OF DIATOMIC MOLECULES 


D v „, E vn , F vll : discussion of a certain number 
of physical phenomena, using the results of 
chapter VII. 

D VII : the properties of an atom in a magnetic 
field (diamagnetism, paramagnetism, Zeeman 
effect). Moderately difficult, important because 
of its numerous applications. 

E VII : complement intended to introduce the 
concept of an atomic hybrid orbital, essential for 
the understanding of certain properties of the 
chemical bond. No theoretical difficulty. Stresses 
the geometrical aspect of wave functions. 

F VI1 : direct application of the theory of 
chapter VII to the study of the vibrational- 
rotational spectrum of heteropolar diatomic 
molecules. Sequel to complements A v (§1) 
and C VI ; moderately difficult. 


G VII : EXERCISES G VII : exercise 2 studies the influence of a uni¬ 

form magnetic field on the levels of a simple 
physical system, in an exactly soluble case. It 
thus provides a concrete illustration of the gene¬ 
ral considerations of complements C VII and 
D V1I concerning the influence of the paramagnetic 
and diamagnetic terms of the Hamiltonian. 
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Complement A ViI 
HYDROGEN-LIKE SYSTEMS 


1. Hydrogen-like systems with one electron 

a. Electrically neutral systems 

a. Heavy isotopes of hydrogen 
p. Muonium 
y. Positronium 

<5. Hydrogen-like systems in solid state physics 

b. Hydrogen-like ions 

2. Hydrogen-like systems without an electron 

a. Muonic atoms 

b. Hadronic atoms 


The calculations of chapter VII, which enabled us to determine numerous 
physical properties of the hydrogen atom (energy levels, spatial distribution of 
the wave functions, etc.) are based on the fact that the system under study is 
composed of two particles (an electron and a proton) whose mutual attraction 
energy is inversely proportional to the distance between them. There exist 
numerous other systems in physics which fulfil this condition: deuterium or 
tritium, muonium, positronium, muonic atoms, etc. The results obtained in 
chapter VII are therefore directly applicable to these examples. All we need to do 
is change the constants introduced in the calculations (masses and charges of the 
two particles). This is what we shall do in this complement, in which we shall 
study, in particular, how the Bohr radius and ionization energy Ej are modified 
in each of the systems to be considered. The wave functions associated with 
their stationary states and the corresponding energies will then be obtained by 
replacing, in formulas (C-39) and (C-47) of chapter VII, a 0 and E, by their new 
values, which give the order of magnitude of the spatial extension of the wave 
functions and the binding energies of these systems. 

We recall the expressions for a 0 and E,: 



Ei 


1 2 2 
— LlC CL 

2 


Mg 4 
2 H 2 


(1) 

( 2 ) 


where n is the reduced mass of the electron-proton system: 


m e m / m 

H = /i(H) =-2— ~ m e 1-- 

m e +m e \ m 


(3) 


and e 2 characterizes the intensity of the attractive potential V(r): 


V(r) = - 


(4) 
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In the case of hydrogen, we have seen that: 

a 0 (H) ~ 0.52 A (5-a) 

E,( H) = 13.6 eV ~ 2.2 10“ 18 J (5-b) 

How can we obtain the corresponding values for a system of two arbitrary 
particles, of masses m x and m 2 , whose attraction energy is : 

- - A ,6) 

(where Z is a dimensionless parameter)? All we must do is calculate the reduced 
mass // of the system by replacing m e and m p by m, and m 2 in (3): 

m,m 2 

P = -(7-a) 

m, + m 2 

and substitute the result obtained into (1) and (2), being careful to perform the 
substitution: 


This is what we shall do in a certain number of physical examples. 


1. Hydrogen-like systems with one electron 

a. ELECTRICALLY NEUTRAL SYSTEMS 

a. Heavy isotopes of hydrogen 

The physical systems closest to the hydrogen atotn are its two isotopes, 
deuterium and tritium. In these atoms, the proton is replaced by a nucleus 
having the same charge but possessing either one or two neutrons in addition 
to the proton. The mass of the deuterium nucleus is approximately 2 m p , and that 
of the tritium nucleus, 3 m p . The reduced masses therefore become, in these two cases: 

//(deuterium) ~ m e ( 1 — (8-a) 


//(tritium) 


cz ml 1 — 


Since: 


m p 1 836 w 

it is clear that the reduced masses of deuterium and tritium are very close to 
that of hydrogen, and that they can be replaced by m e without great inaccuracy. 

If we substitute either (3), or (8-a), or (8-b) into formulas (I) and (2), we see 
that the Bohr radii and energies of the hydrogen, deuterium and tritium' atoms are 
practically the same. Nevertheless, there are slight differences, of the order of a 
thousandth in relative value. These differences can be detected experimentally. 
For example, with an optical spectrograph of sufficient resolution, it can be 
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observed that the wavelengths of the lines emitted by hydrogen atoms are 
slightly greater than those emitted by deuterium, which are in turn greater than 
those emitted by tritium. This slight shift in the emitted wavelengths is related to 
the fact that the nucleus is not infinitely heavy, and does not remain fixed while 
the electron moves; this is called the “nuclear finite mass effect”. Experiments 
have verified that formulas (7-a), (1) and (2) account very precisely for this effect. 

p. Muonium 

The muon is a particle whose fundamental properties are the same as those 
of the electron, except for a difference in mass (the mass m^ of the muon is equal 
to 207 m e ). In particular, the muon is not sensitive to nuclear forces (strong inter¬ 
actions). There are two types of muons, the n~ and the n + , whose charges are 
respectively equal to those of the electron e~ and the positron e + *. Like all 
other charged particles, the muon is sensitive to electromagnetic interactions. 

We can therefore consider a physical system formed by a muon and an 
electron e~, in which the electrostatic attraction is the same as for a proton and an 
electron. Bound states therefore exist. This is, so to speak, a light isotope of 
hydrogen, in which the /i + muon replaces the proton. This “isotope” is called 
muonium (its atomic mass is on the order of mjm p — 0.1). 

It is not difficult to use the results in chapter VII to find the ionization 
energy and Bohr radius associated with muonium; (1), (2) and (7) yield: 

a 0 ( muonium) = a 0 (H) j * - a 0 (H)^l + ^0) ( 10 * a ) 

E,{ muonium) = - £ /( H ) (* “ 2 ^ 0 ) ( 10 ‘ b ) 

Since the muon is approximately ten times lighter than the proton, the nuclear 
finite mass effect is about ten times greater for muonium than for hydrogen; 
however, since the electron is distinctly lighter than the muon, this effect remains 
small (on the order of 0.5 %). For example, the wavelengths of the optical lines 
emitted by muonium should be close to those of the corresponding lines for 
hydrogen. Actually, the optical emission spectrum of muonium has not yet been 
observed. 

Experimentally, the existence of muonium was revealed by its instability: the ^ muon 
decays, emitting a positron and two neutrinos, and the lifetime of muonium is 2.2 x 10 “ 6 seconds. 
The positron resulting from this decay can be detected. It is emitted preferentially in the direction 
of the fi + muon spin** (non-conservation of parity in weak interactions). Detection of the posi¬ 
trons then leads to the experimental determination of this direction. Since, in addition, the 
spin of the + muon of a muonium atom is coupled to that of the electron (hyperfine 
structure coupling; cf. chap. XII and complements), its precession frequency in a magnetic 
field is different from that of a free muon. Measurement of this frequency thus reveals the 
existence of muonium atoms. 

* In addition, like e~ and e + , fi~ and /i + are antiparticles of each other. 

** Like the electron, the muon has a spin of with which is associated a magnetic 
moment M„ = — S. 
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The study of muonium is of very great interest, theoretically as well as experimentally. 
The two particles which constitute this system are not subject to strong interactions, so that 
its energy levels (in particular, the hyperfine structure of the ground state Is) can be calculated 
with great precision, without bringing in any “nuclear correction” (for the hydrogen atom, 
on the other hand, one must take into account the internal structure and polarizability of the 
proton, which are due to strong interactions). Comparison between theoretical predictions and 
experimental results provides a very severe test of quantum electrodynamics. Recent measure¬ 
ment of the hyperfine structure of muonium led to one of the best determinations to date 
of the fine structure constant a = e 2 /ftc . 

y. Positronium 

Positronium is a bound system composed of an electron e~ and a positron 
e + . Like muonium, it can be said by extension to be an isotope of hydrogen, with 
the proton being replaced by a positron. However, it must be noted that the 
situation is not quite the same: in the hydrogen atom, the proton (which is much 
heavier than the electron) remains almost motionless, while in positronium, the 
positron, the antiparticle of the electron, has the same mass and consequently the 
same velocity as the electron when the center of mass of positronium is fixed 
(cf. fig. 1-b). 

According to (7-a), the reduced mass associated with positronium is: 


ttl 

^(positronium) = -r 1 

(11) 

Therefore: 


a 0 (positronium) 2 a 0 (H) 

(12-a) 

Ej (positronium) ~ £j(H) 

(12-b) 


Thus, for a given state of positronium, the average electron-positron distance is 
twice the electron-proton distance for the corresponding state of the hydrogen 
atom (cf. fig. 1). The differences between the energies of the stationary states, 
however, are twice as small, and the optical line spectrum emitted by positronium 
is obtained by doubling all the wavelengths of that of hydrogen. 



a 



FIGURE 1 

Schematic representation of the hydrogen 
(electron 4- proton system) and positro¬ 
nium (electron + positron system) atoms. 
Since the proton is much heavier than the 
electron, it is located practically at the 
center of mass of the hydrogen atom; the 
electron “revolves” about the proton at a 
distance a 0 (M ). On the other hand, the 
positron is equal in mass to the electron; 
these particles therefore both revolve about 
their center of mass, the distance between 
them being a 0 (positronium) = 2a 0 (H). 
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comment: 


One should not conclude from formula (12-a) that the radius of posi- 
tronium is twice that of the hydrogen atom. The Bohr radius gives an idea 
of the extension of the wave functions associated with the “relative particle” 
(i cf. chap. VII, §B), whose position Tj — r 2 is related to the distance between 
the two particles and not to the distance between them and the center of 
mass G. Figure 1 clearly shows, moreover, that the hydrogen and positronium 
atoms are of equal size. In general, all hydrogen-like systems for which 
the attractive potential is given by (6) with Z = 1 have exactly the same radius, 
since formula (B-5) of chapter VII shows that: 


r i 


m 2 

m j + m 2 


r 



(13) 


Using (1), which gives the order of magnitude of the spatial extension of 
the wave function <p, 00 (r) of the ground state, we see that the “radius” p of 
the atom can be defined by: 


m l Ze 2 


where m l is the mass of the lighter particle (the heavier particle is found 
closer to the center of mass). In all the systems considered until now, Z = 1 
and m l — m e ; their radii are therefore the same. We shall see cases later 
in which the radius p is smaller, either because m, ^ m e or because Z # 1. 


The optical spectrum of positronium has only been observed very recently [cf. ref. (11-23)]. 
As for the (hyperfine) structure of the ground state (due to the interaction between the 
magnetic moments of the electron and the positron), it has been accurately determined 
(cf. complement C xn ). 

The fact that positronium, like muonium, is a purely electrodynamic system (neither the 
electron nor the positron is sensitive to strong interactions) explains the importance attached 
to its theoretical and experimental study. 

Let us also point out that positronium is an unstable system. Since the ground state 
is a 1.9 state, the electron and positron come into contact and annihilate, yielding two or three 
photons, depending on the hyperfine structure level they started from. The study of the cor¬ 
responding decay rate is also of great interest in quantum electrodynamics. 


8. Hydrogen-like systems in solid state physics 

Atomic physics is not the only domain of application of the theory presented 
in chapter VII. For example, the “donor atoms” localized in semiconductors 
constitute approximately hydrogen-like systems in solid state physics. Consider a 
silicon crystal; in the silicon lattice, each atom uses its four valence electrons to 
form four tetrahedral bonds with its neighbors. If a pentavalent atom like 
phosphorus (a donor atom) is introduced into the lattice in place of a silicon 
atom, it must lose a valence electron, and its overall charge becomes positive. 
It then behaves like a center which can retain the electron and form a hydrogen- 
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like system with it. Actually, the force acting on the electron cannot be calculated 
directly from Coulomb’s law in a vacuum since silicon has a large dielectric 
constant e ~ 12, so that (4) must be replaced by: 


V(r) = 



(15) 


To be completely rigorous, we should have to replace the electron mass by the 
“effective mass” m* of the electron in silicon, which is different from the free 
electron mass because of interactions with the charges of the nuclei in the crystal. 
Nevertheless, we shall confine ourselves to a qualitative discussion, noting that the 
effect of the high value of e is to decrease e 2 in (15), that is, according to (1), 
to increase the Bohr radius by a factor on the order of 10. The donor atom 
impurity is therefore similar to a very large hydrogen atom, whose wave functions 
are spread over distances much greater than the unit cell length of the silicon 
lattice. 

Let us briefly describe another hydrogen-like system in solid state physics: 
the exciton. Consider a semi-conductor crystal. In the absence of external pertur¬ 
bations, the outside electrons of the atoms forming the crystal are all in states 
belonging to the “valence band” (the temperature is assumed to be sufficiently 
low; cf. complement C X1V ). By suitably illuminating the crystal, we can, through 
the absorption of a photon, cause an electron to go into the “conduction band” 
(which contains a set of energy levels which are higher than those of the valence 
band). There is then one electron missing from the valence band. We can treat 
this band as if it contained a particle of charge opposite to that of an electron, 
called a “hole”. The hole can attract an electron of the valence band and form 
a bound system with it: the exciton. The exciton, like hydrogen, has a series of 
energy levels between which it can undergo transitions. Its existence can be 
demonstrated by a measurement of light absorption by the crystal. 


b. HYDROGEN-LIKE IONS 


The neutral helium atom is composed of two electrons and a positively 
charged nucleus of charge — 2 q e . Such a system, which consists of three particles, 
cannot be studied with the theory of chapter VII. However, if an electron is 
somehow removed from the helium atom, the He + ion is similar to a hydrogen 
atom; the only differences are in the nuclear charge, which is twice that of the 
proton (the total charge of the ion is positive and equal to — q e ) and its mass 
(which, for 4 He, is approximately four times that of the proton). Of course, there 
are other hydrogen-like ions : the Li + + ion (the lithium atom, when it is not 
ionized, has Z — 3 electrons), the Be + + + ion (Z = 4), etc... 

Let us then consider a system formed by a nucleus, of mass M and positive 
charge - Zq e , and an electron. If we substitute (7-b) in (1) and (2), we obtain: 


fl 0 (Z) ~ 


Z 
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(since M > m e , we have neglected the difference between the reduced mass of 
hydrogen and that of the hydrogen-like ion under study; the consequences of the 
nuclear finite mass effect on a 0 and E, are, in effect, negligible compared to those 
due to the charge variation). Hydrogen-like ions are therefore all smaller than the 
hydrogen atom, as one would expect since the nucleus and the electron are more 
strongly bound. Moreover, their energy increases rapidly with Z (quadratically): 
for example, the energy which must be supplied to a Li ++ ion to remove its last 
electron is greater than 100 eV. This is why the electromagnetic frequencies which 
can be emitted or absorbed by a hydrogen-like ion fall into the ultraviolet, and 
even, when Z is large enough, into the domain of X-rays. 


2. Hydrogen-like systems without an electron 

Thus far, the systems we have considered all include an electron. Nevertheless, 
there exist numerous other particles having the same charge q e , able to form a 
hydrogen-like system with a nucleus of charge — Zq e . We shall give a few 
examples. The “atoms” which we are going to describe here are of course less 
common than the “usual” atoms which appear in Mendeleev’s classification. They 
are unstable, and, in order to study them, it is necessary to use high-energy particle 
accelerators to produce the particles needed for their formation. This is why they 
are sometimes called “exotic atoms”. 


a. MUONIC ATOMS 


We have already mentioned some essential features of the muon and pointed 
out the existence of the muon. When this particle is attracted by a positively 
charged atomic nucleus, it can form a bound system with it which is called a “muo¬ 
nic atom”*. 

Consider, for example, the simplest muonic atom, which is composed of 
a n~ muon and a proton; this is a neutral system whose Bohr radius is: 


a 0 (v >P + ) - 




Qo(g) 

200 


and whose ionization energy is: 


p / _ +v m/ 


200 E,(H) 


(18) 


(19) 


The size of this muonic atom is therefore on the order of several thousandths of 
an Angstrom. Its spectrum is obtained from that of hydrogen by dividing the 
wavelengths by 200; it therefore falls into the domain of soft X-rays. 


* We could also imagine a bound system composed of a n + muon and a fi muon. However, 
given the low intensity of muon beams which we know how to produce, such an atom is difficult to create 
and has never been observed. 
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What happens if, instead of revolving about a proton, the n is captured by 
a nucleus N whose charge is Z times greater, like lead, for example, for 
which Z = 82*? Formulas (1) and (2) then yield: 


a 0 (fi-,N) 


eM 

200 Z 


( 20 ) 


Ej(n~, N) * 200 Z 2 Ej(H) 


( 21 ) 


Setting Z = 82 in these formulas, we find for the transitions of the lead muonic 
atom energies equal to several MeV (1 MeV = 10 6 electron volts). However, it 
must be noted that formulas (1) and (2) are no longer valid in this case, since 
equation (20) would yield : 

a 0 (n~, Pb) a 3 x 10" 5 A = 3 Fermi (22) 

that is, a radius slightly smaller than the radius of the lead nucleus. The calculations 
of chapter VII are therefore no longer valid. This is because they are based on 
form (6) of the potential V{r), which is correct** only when the particles under 
study, separated by distances much greater than their dimensions, can be considered 
to be point particles. This hypothesis, very well satisfied for hydrogen, is not 
valid in the case studied here. 

However, (20) and (21) give the correct order of magnitude of the energies and 
radius of the lead muonic atom. The physical consequences of the existence of 
a non-zero spatial extension of the nucleus (“volume effect”) will be studied in 
greater detail in complement D XI . We take this occasion to note, however, that one 
of the reasons for interest in muonic atoms is precisely related to this type of 
effect: the /j.~ muon “explores”, as it were, the internal structure of the nucleus***, 
and the energy levels of muonic atoms depend on the electrical charge distribution 
and magnetism inside the nucleus (recall that the muon is not sensitive to nuclear 
forces). Thus, the study of these states can furnish information which is very 
useful in nuclear physics. 


b. HADRONIC ATOMS 

“Hadrons” are those particles which are subject to strong interactions, as 
opposed to “leptons”, which are not. Electrons and muons, whose bound states 
in a Coulomb potential we have studied thus far, are leptons. Protons, neutrons 
and mesons such as the it meson, etc... are hadrons. Of the latter, those which are 
negatively charged can form a hydrogen-like bound system with an atomic nucleus, 


* Such a system can be formed by directing a fi~ beam onto a target of lead atoms. When a 
is captured by a lead nucleus, it revolves about it at a distance which is about 200 times smaller 
than the distance to the electrons of the innermost shell of the atom. The nuclear charge is therefore 
practically the only one to affect the muon. Thus, in studying the states of the muonic atom, we can simply 
ignore the electrons. 

** Inside the nucleus, the potential is approximately parabolic (cf. complements A v , § 4 and D XI ). 

*** The concept of the impenetrability of two solid bodies is macroscopic. In quantum mechanics, 
nothing prevents the wave functions of two particles of different nature from overlapping. 
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called a “hadronic atom”. For example, the nucleus-77-“ meson system yields a 
“pionic atom”; the nucleus-! -- particle system, a “sigmaonic atom”*; the 
nucleus-A -- meson system, a “kaonic atom”; the nucleus-antiproton system, 
an “antiprotonic atom”, etc... All of the systems just cited have actually been 
observed and studied. They are all unstable, but their lifetimes are long enough 
for us to observe some of their spectral lines. Hydrogen atom theory, which takes 
into account only the electrostatic interaction between the two particles under 
consideration, does not, of course, apply to such systems, in which strong 
interactions play an important role. However, since they are of very short range, 
strong interactions can be neglected in the study of excited states of the hadronic 
atom (other than the s states), in which the two particles are far apart. The theory 
of chapter VII is then applicable, as are formulas (1) and (2), which, in all these 
cases, lead to much smaller Bohr radii and much greater energies than for hydrogen. 
Thus, measurement of the spectral frequencies emitted by pionic atoms gives 
a very precise determination of the mass of the 7r - meson. 


References and suggestions for further reading: 

Exotic atoms: see the subsection “Exotic atoms” of section 11 of the bibliography; 
see also Cagnac and Pebay-Peyroula (11.2), chap. XIX, §7; Weissenberg (16.19), chap. 4, 
§2 and chap. 6. 

Excitons: Kittel (13.2), chap. 17, p. 538; Ziman (13.3), §6.7. 


* The term “mesic atom" is sometimes used to denote a system involving a meson. Similarly, 
since the I~ is a hyperon (a particle heavier than the proton), the sigmaonic atom is sometimes called a 
“hyperonic atom". 
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Complement B V n 

A SOLUBLE EXAMPLE OF A CENTRAL POTENTIAL: 

THE ISOTROPIC THREE-DIMENSIONAL HARMONIC OSCILLATOR 


1. Solving the radial equation 

2. Energy levels and stationary wave functions 


In this complement we shall examine a special case of a central potential for which 
the radial equation is exactly soluble: the isotropic three-dimensional harmonic 
oscillator. We have already treated this problem (complement Ey) by considering 
the state space $, as the tensor product & x (g) S y <g) S z ; this amounts, in 
the { | r > } representation, to separating the variables in Cartesian coordinates. 
We thus obtained three differential equations, one in the x-variable, one in y, and the 
third in z. Here we intend to seek the stationary states which are also eigenstates 
of L 2 and L z by separating the variables in polar coordinates. We shall then 
indicate how the two bases of S t obtained by these two different methods are 
related to each other. 

We shall also study, in complement A vl „, the stationary states of well- 
defined angular momentum of a free particle. This can be considered to be another 
special case of a central potential [F(r) = 0] which leads to an exactly soluble 
radial equation. 

A three-dimensional harmonic oscillator is composed of a (spinless) particle 
of mass n subjected to the potential: 

F(x, y,z)=^n [ co 2 x x 2 + (o 2 y y 2 + co 2 z 2 ] (1) 

where w x , u> y and oj z are real positive constants. The oscillator is said to be isotropic if: 

(o x = a> y = (D z = (O ( 2 ) 

Since the potential (1) is the sum of a function of x alone, a function of 
y alone and a function of z alone, we can solve the eigenvalue equation of the 
Hamiltonian: 

H ~% + nR) ,3) 

by separating the variables x, y and z in the { | r > } representation. This is what 
was done in complement Ey. The energy levels, for an isotropic oscillator, are 
then found to be of the form : 
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where n is any positive integer or zero. The degree of degeneracy g n of the level E„ 
is equal to: 

9 n = + !)(» + 2 ) (5) 

and the associated eigenfunctions are: 


<P« x ,«y,n,( x > y> z ) = 


-~~{x 2 + y 2 + z 2 ) 

e 2 


s /2 n * +n y +n * n x l n y l n z \ 


x H (fix) H (jiy) H ni (jjz) (6) 


with: 


' ■ /r « 

[H p (u) denotes the Hermite polynomial of degree p ; cf. complement B v ]. (f> nxynyt „ x is 
an eigenfunction of the Hamiltonian H with the eigenvalue E„ such that: 

n = n x + n y + n z (8) 

If the oscillator under consideration is isotropic* the potential (1) is a 
function only of the distance r between the particle and the origin: 

V{r) = ^ po) 2 r 2 (9) 

Consequently, the three components of the orbital angular momentum L are 
constants of the motion. We want to find the common eigenstates of H, L 2 and L z . 
To do so, we could proceed, as in complement D vl , by introducing operators related 
to right and left circular quanta and to “longitudinal” quanta corresponding to 
the third degree of freedom along Oz (an outline of this method is given at the end 
of this complement). However, we prefer to use this example to illustrate the 
method elaborated in chapter VII (§A) and solve the radial equation by the 
polynomial method. 


1. Solving the radial equation 


For a fixed value of the quantum number /, the radial functions R ktl (r) and 
energies E k l are given by the equation: 


T ’■'Id 2 ,1 2 

-- r +- gor 

2p r dr 2 2 


l{l + l)fi : 


Rk,l( r ) ~ Ek,l Rk,l ( r ) 


Separation of the polar variables r , 6, <p is possible only for an isotropic oscillator. 
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We set: 


RkM = - u k l (r) 


Equation (10) then becomes: 


(11-a) 


(11-b) 


A- _ /JV _ l ii + . 1 ) + e 1 u ( r ) = o 

dr 2 r 2 


[where /? is the constant defined in (7)]. We must add the condition at the origin: 

« m (0) = 0 (13) 

For large r, (12) virtually reduces to: 


ip - P ' 2 u U r ) r ^° 


The asymptotic behavior of the solutions of equation (12) is therefore dominated 
by e^ 2r2/2 or e“^ 2r2/2 . Only the second possibility is physically acceptable. This 
leads us to the change of functions: 

»,.,(<•) = e- ,Vl2 y*A') (15) 


It is easy to find that y k l {r) must satisfy: 


3 > m - 2 (S 2 r 


yJP) = 0 


J-Tu + h.i - P 2 - — ■ + —= o 
dr L r _ 


(16-a) 


( 16 -b) 


Now we shall seek y k ,(r) in the form of a power series in r: 

00 

y k ,i( r ) = r ° I a q r 9 


where, by definition, a 0 is the coefficient of the first non-zero term: 

a o 9* 0 (18) 

When we substitute expansion (17) into equation (16-a), the term of lowest order 
is in r s ~ 2 ; its coefficient is zero if: 

[s(s - 1) - /(/ + 1)> 0 = 0 (19) 

With conditions (18) and (16-b) taken into account, the only way to satisfy 
relation (19) is to choose: 

* = / + 1 1201 
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(this result could have been predicted; cf. §A-2-c of chapter VII). The next term 
in the expansion of equation (16-a) is in r s ~ \ and its coefficient is equal to: 

[s(s + 1) - /(/ + I)]a, (21) 

Since s is already fixed by (20), this coefficient can go to zero only if: 

«i = 0 (22) 

Finally, let us set the coefficient of the general term in r q+s equal to zero: 

[fo + * + 2 ){q + 5 + 1) - /(/ + 1 )\a q + 2 

+ \e k l - 0* -2/i 2 (q + s)]a q = 0 (23) 

that is, using (20): 

(d + 2)(# + 21 + 3 )a q+2 = [(2 q + 21 + 3)0 2 — e kJ ]a q (24) 


We therefore obtain a recurrence relation for the coefficients a q of expansion (17). 

Note, first of all, that this recurrence relation, combined with result (22), 
implies that all coefficients a q of odd rank q are zero. As for the coefficients of 
even rank, they must all be proportional to a 0 . If the value of e k t is such that no 
integer q makes the term in brackets on the right-hand side of (24) go to zero, we find 
the solution y k , of (16) in the form of an infinite power series, for which: 


i „+2 2/? 2 

a q q 


(25) 


This behavior is the same as that of the coefficients appearing in the expansion 
of the function e p2r2 , since: 


00 



P~ 0 


with: 


c 2 p 


p 2p 

p! 


(26) 


(27) 


and, consequently: 

£l£±I ~ t (28) 

c 2p p-® p 


Since it is 2 p which corresponds to the even integer q of the expansion of y k ,, (28) is 
indeed identical to (25). From this, we can see that if (17) really contains an infinite 
number of terms, the asymptotic behavior of y k , is dominated by e fi2 ' 2 , which 
renders this function physically unacceptable [ cf. relation (15)]. 

The only cases which are interesting from a physical point of view are 
therefore those in which there exists an even integer k, positive or zero, such that: 

e kJ = (2k + 21+ 3)/? 2 (29) 
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Recurrence relation (24) indicates that the coefficients of even rank greater than k 
are then zero. Since all the coefficients of odd rank are also zero, expansion (17) 
reduces to a polynomial, and the radial function u k ,(r) given by (15) decreases 
exponentially as r goes to infinity. 


2. Energy levels and stationary wave functions 

Using definitions (7) and (11-b), relation (29) gives the energies E kl associated 
with a given value of /: 

E kl = ftco^k + l + —^ (30) 

where k is any even positive integer or zero. Since E kl actually depends only on the 
sum: 

n = k + l (31) 

accidental degeneracies appear: the energy levels of the isotropic three-dimensional 
harmonic oscillator are of the form : 



/ is any positive integer or zero, and k is any even positive integer or zero; n can 
therefore take on all integral values, positive or zero. This is in agreement with 
result (4). 

We shall fix an energy E n , that is, an integer «, positive or zero. The values 
of k and l which can be associated with it according to (31) are the following: 

(k, l ) = (0, «), (2, n — 2), ... (n — 2, 2), (n, 0) if n is even (33-a) 

(k, l ) = (0, n), (2, n — 2), ... (n — 3, 3), (n — 1, 1) if n is odd (33-b) 

From this, we can immediately get the values of / associated with the first values 
of n : 

n = 0 : / = 0 

n = 1 : / = 1 

n = 2: l = 0, 2 (34) 

n = 3 : / = 1, 3 

n - 4 : / = 0, 2, 4 

Figure 1 represents, with the same conventions as for the hydrogen atom (cf. figure 4 
of chapter VII), the lowest energy levels of an isotropic three-dimensional harmonic 
oscillator. 

For each pair ( k , /), there exists one and only one radial function u k ,(r), that 
is, (21 + 1) common eigenfunctions of H, L 2 and L z : 

<Pk,i,m(r) = ~ r u k l (r) YT(6, q>) (35) 
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1 







Lowest energy levels of the three-dimensional harmonic oscillator. When n is even, / can take on 

^ + lvalues: / = «,«— 2, ... 0. When n is odd, / can take on values: / = «,«— 2, ... 1. 

With the possible values of m(— l m ^ /) taken into account, the degree of degeneracy of the 

,evel E Is — — i^!— + .—>. 

" 2 


Consequently, the degree of degeneracy of the energy E n under consideration is 
equal to: 

g„ = Y (2/ +1) if n is even 

l = 0,2,-,n 

g„ = Y (2f +1) if n is odd 

1 = 1,3, —,H 

These sums are simple to calculate, and we again obtain result (5): 

m/2 i 

for even n : g n = (4p 4- 1) = — (h + l)(n + 2) (37-a) 

P =o 1 

for odd n : g„ = £ (4p + 3) = \ (n + l)(n + 2) (37-b) 

p = o z 

For each of the pairs (&, /) given in (33), the results of § 1 enable us to determine 
the corresponding radial function u k l {r) (to within the factor a 0 ) and, therefore, 
the (2/ + 1) common eigenfunctions of H and L 2 , of eigenvalues E n and /(/ + 1 )ft 2 . 
We shall calculate, for example, the wave functions associated in this way with the 
three lowest energy levels. 

3 

For the ground state E 0 = — Tuo, we must have: 

k = l — 0 (38) 

y 0 0 (r) then reduces to a 0 r. If we choose a 0 to be real and positive, the normalized 
function <p k = l=m=0 can be written: 

c- 02 ' 2 ' 2 (39) 


(36-a) 

(36-b) 
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Since the ground state is not degenerate ( g 0 = 1), <p 0 ,o,o is the same as the 
function (p„ x=ny=ni=0 which is found by separating the Cartesian variables x, y, 
and z [cf formula (6)]. 

With the first excited state /icoj, which is three-fold degenerate, is 

again associated a single pair (A:, /): 
ffc = 0 


and y 0 , = a 0 r 2 . The three functions of the basis defined by L 2 and L z are therefore : 

fo « 3/2 

<Po.tJr) = m = 1,0, - 1 (41) 

V 3 n ’* 

We know [cf. complement A VI , formulas (32)] that the spherical harmonics Y" are 
such that: 

rY ^<P) = jl* 

-nrW - y []- /!* 


-7=[ y r' +yn = -Kiiy 


and that the Hermite polynomial of order 1 is [cf. complement B v , formulas (18)] : 
Hfu) = 2u (43) 

Consequently, it is clear that the. three functions <p 0 ,i >m are related to the 
functions (p„ x ^ y<nz of basis (6) by the equations: 

( Pn x = 0,n y = 0,n z = 1 = ( Pk = O.l = 1 ,m = 0 

1 r- 


= l,Bj, = 0,n 2 = 0 


[ < Pk = 0.l= l,m= - 1 < Pk = 0,t = l,m= l] 


( Pn x = 0,n y =l,n z = 0 ~ ^y^[_ ( Pk = 0,l=l.m=-l < Pk= 0,1 = 1 ,m = l] (^4) 

7 

Finally, consider the second excited state, of energy E 2 = - ha>. It is six-fold 
degenerate, and the quantum numbers k and / can take on the values: 

k = 0, 1 = 2 (45-a) 

k = 2, 1 = 0 (45-b) 

The function y 0 2 (r) which corresponds to the values (45-a) is simply a 0 r 3 . For 
the values (45->bj, y 20 contains two terms; using (24) and (29), we easily find: 


i,o( r ) = a 0 r 1 - j fi 2 r 2 
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The six basis functions in the eigensubspace associated with E 2 are thus of the form: 

It A 03/2 

<Po. 2 » = ^J 2r2 e ~ Pr2i2 Y i( 0 ' <P) m = 2, 1, 0, - 1, - 2 (47-a) 




(47-b) 


Knowing the explicit expressions for the spherical harmonics [formulas (33) of 
complement A VI ] and the Hermite polynomials [formulas (18) of complement B v ], 
we can easily prove the following relations: 

^Pk = 2J - O.m = 0 ~ 2, " y ~ = ° ~ °*".v = = 0 ^M. v “ 0.n v = 0.« z = 2I 


[ < Pk = 0,/ = 2.m = 2 + ( Pk = 0J = 2.m= -2] 


[*Pn* - 2,By = 0,n z = 0 = 0,r» v = 2.n z = 0J 


[ < Pfc = 0 v / = 2,m = 2 < Pfc = 0J = 2,m=- 2 ] “ * ( Pn x = 1 .r» v = 1 ,n* = 0 


[ < Pk = 0,/ = 2,m= 1 < Pfc = 0,/ = 2,m= - l] “ = 1 - = 0 ,n 2 = i 


[ < Pfc = 0.f = 2.m = 1 < Pk = 0,l = 2,m= - ll “ 1 ^n* = 0.n y = 1 .n* = 1 


_ /_ r 

Wk = 0.1 = 2.m = 0 / "^ n v = 0. n v = 0 .n ~ = 2 "2 = 2,n y = 0,n T = 0 


2 *Pn x = 0,n y = 2,n* = 0 ] 


comment: 


As we pointed out in the beginning of this complement, we can apply 
a method analogous to the one presented in complement D V1 to the isotropic 
three-dimensional harmonic oscillator. If a x , a y and a z are the annihilation 
operators which act in the state spaces 8 X , 8 y and 8, respectively, we define: 


a d = -(a x - ia y ) 
v 2 


(49-a) 


Ug (flljf 4“ IUjj) 


(49-b) 


It can be shown that a d and a g behave like independent annihilation oper- 
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ators (complement D VI , §3-b). The Hamiltonian H and the angular 
momentum operators can then be expressed in terms of a d , a g , a z and their 
adjoints: 


H = TuolN d +N g +N Z +j 


L z = h(N d - N g ) 

L+ =hy /2 (ala g - a> 2 ) 
L_ = Kn/2 (a\a z - ala d ) 


(50-a) 

(50-b) 

(50-c) 

(50-d) 


The common eigenvectors | X nd ,n e , ni > °f the observables N d , N g and N z can be 
obtained through the action of the creation operators a' d , a' g and a\ on the 
ground state 10, 0, 0 ) of the Hamiltonian H [this state is unique to within 
a constant factor; cf. formulas (6) and (39)]: 

I > = = -- I o, 0, 0 > (51) 

v n d l n g \ n z \ 

According to (50-a) and (50-b), | x nd ,„ g ,„ z > is an eigenvector of H and L z 
with the eigenvalues {n d + n g + rt z + 3/2fia> and (n d — n g fi. The eigen- 
subspace S„ associated with a given energy E„ can therefore be spanned by 
the set of vectors | X. nd , ng ,n z ) such that: 

n d + n g + n z = n (52) 

Of these, the eigenvector of L z with the largest eigenvalue compatible 
with E n is | Xn.o.o )> whose eigenvalue is rih. This ket, according to (50-c), 
satisfies: 

L + | X„.o.o > =0 (53) 

Consequently*, it is an eigenvector of L 2 with the eigenvalue n{n + \fi 2 , 
and it can be identified with the ket of the { | (p k l m ) } basis such that: 

k + l = n 

l = m — n (54) 

Therefore: 

I *Pk = 0,l=n,m = n ^ I Xn d =n,n g =0,n z = 0 ^ (55) 

Application of the operator L_ [formula (50-d)] to both sides of 
relation (55)yields: 

I *Po,n,n —1 y | Zb— 1 , 0,1 ) (^6) 


yields: 


* This result follows directly from relation (C-7-b) of chapter VI, which, applied to | x„,o,o ), 
L 2 I Xn. 0.0 > = + n) | Xn.0,0 > 
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The eigenvalue (« — 2)h of L z , unlike the two preceding ones, is 
two-fold degenerate in S n : two orthogonal vectors, | £,- 2 , 0 , 2 ) and | X n -i , 1,0 ) 
correspond to it. Using (50-d) again in order to apply L_ to (56), we find that: 


Vo , 11 , 11-2 ) 


ft" - 0 1, > 

yj In - 1 1 *"- 2 > 0 ’ 2 ; 


1 

\i In — 1 


Zn- 1,1,0 


> 


(57) 


It can be shown that the action of L + on the linear combination orthogonal 
to (57) yields the null vector. This linear combination must therefore be an 
eigenvector of L 2 with the eigenvalue (rt — 2)(« — 1 )ft 2 . This gives, to 
within a phase factor: 


V2.n-2.n-2 / — 


\J 2n — 


Xn-2.0,2 y 


/2(w -1) 

In — 1 


Xn-1,1,0 y 


(58) 


We can thus relate, by iteration*, the two bases, { | Xn 4 ,n g ,n t ) } 
and { \ Vk.i. m y }• Of course, replacing a x d and a* g by functions of a], and a* 
in (51), we can express \x „ d as a linear combination of the vec¬ 
tors | Vn x ,n y ,n I ) whose wave functions are given by (6). 


References and suggestions for further reading: 

Other soluble examples (spherical square well, etc.): Messiah (1.17), chap. IX, § 10; 
Schiff (1.18), §15; see also Fliigge (1.24), §§58 to 79. 


* An argument analogous to the one just outlined will be used in chapter X to add two angular 
momenta. 
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Complement C V u 

PROBABILITY CURRENTS ASSOCIATED 

WITH THE STATIONARY STATES OF THE HYDROGEN ATOM 


1. General expression for the probability current 

2. Application to the stationary states of the hydrogen atom 

a. Structure of the probability current 

b. Effect of a magnetic field 


The normalized wave functions <p n> j >m (r) associated with the stationary states 
of the hydrogen atom were determined in chapter VII. <p nJm (r) is the product of 
the spherical harmonic T“(0, q>) and the function R n ,(r) calculated in §C-3 of this 
chapter: 

<P H ,i, m (r) = R Hj (r)YT(e,(p) ( 1 ) 

The spatial variation of the probability density: 


Pn,ijr) = Kf,m(0| 2 (2) 

was then studied, at least for the lowest energy states. 

It is important, however, to understand that a stationary state is not charac¬ 
terized only by the value of the probability density p n l m (r) at all points of space. 
We must also associate with it a probability current which can be expressed as: 


j Ji <«,m( r ) V<P«,iJ r ) 


+ c.c. 


( 3 ) 


[we assume here that the vector potential A(r, t) is zero; p denotes the mass of the 
particle]. 

Thus, we can associate with the quantum state of a particle, a “fluid” (called 
the “probability fluid”) whose density at each point of space is p(r). This fluid is 
not motionless; it is in a state of flow characterized by the current density J. In 
a stationary state, p and J are not time-dependent: the fluid is in a steady state 
of flow. 

To complete the results of chapter VII concerning the physical properties 
of the stationary states of the hydrogen atom, we shall now study the probability 
currents J„ Xm (r). 


1. General expression for the probability current 

Consider an arbitrary normalized wave function \p(r). We introduce the 
real quantities a(r) [the modulus of (/'(r)] and £(r) [the argument of ^(r)] by 
setting: 

(A(r) = a(r) e i4(r) (4) 
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with: 

a(r) ^ 0 ; 0 < £(r) < 2n (5) 

If we substitute (4) into the expressions for the probability density p{ r) and the 
current J(r), we obtain [still assuming the vector potential A(r) to be zero]: 

P( r) = « 2 (r) (6) 

J(r) = ^a 2 (r)V^(r) (7) 

Therefore, p(r) depends only on the modulus of the wave function, while J(r) brings 
in its phase [for example, J(r) is zero if this phase is constant throughout all space]. 


comment: 


If the wave function i/f( r) is given, it is obvious that p( r) and J(r) are perfectly 
well-defined. Conversely, is there always one and only one function ^(r) corresponding 
to given values of p( r) and J(r)? 

According to (6), the modulus a(r) of the wave function can be obtained directly 
from p( r)*; the argument £(r) must satisfy the equation: 




am 

h p(r) 


( 8 ) 


We know that such an equation has a solution only if: 

Vx^=0 (9) 

p(r) 

It then has an infinite number of solutions, which differ from each other by a constant. 
Since this constant corresponds to a global phase factor, it follows, if condition (9) is 
satisfied, that the wave function of the particle is perfectly well-defined by the specifi¬ 
cation of p( r) and J(r). If condition (9) is not fulfilled, no wave function exists which 
corresponds to the values of p( r) and J(r) under consideration. 


2. Application to the stationary states of the hydrogen atom 

a. STRUCTURE OF THE PROBABILITY CURRENT 

When the wave function has the form of (1), where R„,,(r) is a real function 
and Y™(6, q>) is the product of e'""*'’ and a real function, we have: 

K»IIW«>>I 

{.,!*(') “ m< P < 10 ) 

* Of course, in order to be a probability density, p(r) must be positive everywhere. 
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Applying formula (7) and using the expression for the gradient in polar coordinates, 
we obtain: 




(r) 


* m £ndA r) 

p r sin 6 


e „( r ) 


( 11 ) 


where e^(r) is the unit vector which forms with Oz and r a right-handed Cartesian 
coordinate frame. 

Probability current variations in a plane perpendicular to Oz are shown in 
figure 1. 



FIGURE 1 

Structure of the probability current associated with 
a stationary state | (p n Um > of the hydrogen atom 
(shown in a plane perpendicular to Oz). The index m 
refers to the eigenvalue mh of L z . If m > 0, the pro¬ 
bability fluid rotates counterclockwise about Oz ; if 
m < 0, clockwise. If m — 0, the probability current 
is zero at all points of space. 


According to (11), the current at each point M is perpendicular to the plane 
defined by M and the Oz axis: the probability fluid rotates about Oz. Since | J| is 
not proportional to r sin# p(r), the system does not rotate as a whole. The eigen¬ 
value nih of the observable L z can be interpreted as the classical angular momentum 
associated with this rotational motion of the probability fluid. The contribution 
of the volume element d 3 r, situated at the point r, to the angular momentum with 
respect to the origin can be written: 


x Ki. m {r)d 3 r 

(12) 

By symmetry, the resultant of all these elementary momenta is 
it is equal to: 

directed along Oz ; 

= d | d 3 r e 2 . [r X J B , i>m (r)] 

( 13 ) 

Using expression ( 11 ) for J B , m (r), we easily obtain: 


= fij d 3 rr |J Bj(>m (r)| sin 0 


= mh j d 3 r p B>l , m (r) 

( 14 ) 

= mh 
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b. EFFECT OF A MAGNETIC FIELD 


The results obtained thus far are valid only if the vector potential A(r) is 
zero; let us examine what happens when this is not the case. Assume, for example, 
that we place the hydrogen atom in a uniform magnetic field B. This field can be 
described by the vector potential: 

A(r) = -jrxB (15) 


What is the probability current associated with the ground state ? 

For simplicity, we shall also assume that the magnetic field B does not 
modify the wave function of the ground state*. The probability current can then be 
calculated from the general expression for J [ cf. relation (D-20) of chapter III]. 
This yields: 


^«,l,m(0 2j_i 



7 V - «A(r) 




<p n ,iA r ) + cc - 



.«(r)[»V^»- 9 A(r)] 


(16) 


For the ground state and a field B directed along Oz, we obtain, using (15): 
Ji.o.oW = y Pi.o.oO’Kx r 


(17) 


where the cyclotron frequency a> c is defined by: 


(O c = 


qB 




(18) 


Therefore, the probability current of the ground state is not zero in the 
presence of a magnetic field as it is when B = 0. Expression (17) indicates that 
the probability fluid rotates as a whole about B with the angular velocity a>J2. 
Physically, this result arises from the fact that when the magnetic field B is turned 
on, a transient electric field E (t) must exist. Under its influence, the electron, while 
remaining in the ground state, goes into rotation about the proton, with an angular 
velocity depending only on the value of B (and not on the precise way in which 
the field was turned on during the transition period). 


comment: 

The particular choice of gauge made in (15) allowed us to retain the same wave 
functions we used with only a negligible error in the absence of a held (cf. note at the 
bottom of this page). With another gauge, the wave functions'would have been different 
(cf. complement H,„) and, in (16), the term containing A(r) explicitly would not have 
been the only one to contribute to the value of J(r), to first order in B. Nevertheless, 
we would have found (17) at the end of the calculation, since the physical result must 
not be gauge-dependent. 

* Since the Hamiltonian H depends on B, this is obviously not rigorously correct. Nevertheless, 
by considering the expression for H [cf. formulas (6) and (7) of complement D vll ], we can show that, 
for the gauge chosen in (15) and B directed along Oz , the functions q> n l m (r) are eigenfunctions of H to 
within a term of second order in B. By using the perturbation theory of chapter XI, one can show that, 
for magnetic fields normally produced in the laboratory, this second-order term is negligible. 
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Complement D V h 

THE HYDROGEN ATOM PLACED 
IN A UNIFORM MAGNETIC FIELD. 
PARAMAGNETISM AND DIAMAGNETISM. 
THE ZEEMAN EFFECT 


1. The Hamiltonian of the problem. The paramagnetic term and the diamagnetic term 

a. Expression for the Hamiltonian 

b. Order of magnitude of the various terms 

c. Interpretation of the paramagnetic term 

d. Interpretation of the diamagnetic term 

2. The Zeeman effect 

a. Energy levels of the atom in the presence of the magnetic field 

b. Electric dipole oscillations 

c. Frequency and polarization of emitted radiation 


In chapter VII, we studied the quantum mechanical properties of a free hydrogen 
atom, that is, of the system formed by an electron and a proton exerting an elec¬ 
trostatic attraction on each other but not interacting with any external field. This 
complement is devoted to the study of the new effects which appear when this atom 
is placed in a static magnetic field. We shall confine ourselves to the case in which 
this field is uniform, as it always is, moreover, in practice, since the magnetic 
fields that can be produced in the laboratory vary very little in relative value over 
distances comparable to atomic dimensions. 

We have already studied the behavior of an electron subjected either to an 
electric field alone ( cf. for example, chapter VII) or to a magnetic field alone ( cf. com¬ 
plement E VI ). Here, we shall generalize these discussions by calculating the energy 
levels of an electron subjected both to the influence of the internal electric field 
of the atom and to an external magnetic field. Under these conditions, the exact 
solution of the Schrodinger equation may seem to be a very complicated problem. 
However, we shall see that this problem can be simplified considerably by means of 
certain approximations. In the first place, we shall totally neglect the nuclear finite 
mass effect*. Then we shall use the fact that, in practice, the effect of the external 
magnetic field is much smaller than that of the internal electric field of the atom : 
the atomic level shifts due to the magnetic field are much smaller than the energy 
separations in a zero field. 

The discussion presented in this complement will enable us to introduce and 
explain certain effects which are important in atomic physics. We shall see, in 


* For the hydrogen atom, such an approximation is justified by the fact that the proton is consi¬ 
derably heavier than the electron. For muonium (cf complement A VII ), the approximation is not as good 
and it becomes totally inapplicable for the case of positronium. We note, moreover, that, in the presence 
of a magnetic field, it is not rigorously possible to separate the motion of the center of mass. If 
one wished to take the nuclear finite mass effect into account in this complement, it would not suffice 
to replace the mass m e of the electron by the reduced mass p of the electron-proton system. 
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particular, how atomic paramagnetism and diamagnetism appear in the quantum 
mechanical formalism. In addition, we shall be able to predict the modifications 
occurring in the optical spectrum emitted by hydrogen atoms when they are placed 
in a static magnetic field (the Zeeman effect). 


1. The Hamiltonian of the problem. 

The paramagnetic term and the diamagnetic term 


a. EXPRESSION FOR THE HAMILTONIAN 

Consider a spinless particle, of mass m e and charge q, subjected simultaneously 
to a scalar central potential V (r) and a vector potential A(r). Its Hamiltonian is : 

W = 2~ [P — <?A(R)] 2 +F(R) (1) 

When the magnetic field B = V x A(r) is uniform, the vector potential A 
can be put into the form: 

A(r)=-irxB (2) 

To substitute this expression into (1), we shall calculate the quantity: 

[P — ^rA(R)] 2 = P 2 +|[P.(Rx B) + (R x B). P] +^(Rx B) 2 (3) 

Now, B is actually a constant and not an operator. All observables therefore 
commute with B, so we can write, using the rules of vector calculus: 

[P - <?A(R)] 2 = P 2 + | [B . (P x R) — (R x P). B] [R 2 B 2 - (R. B) 2 ] 

(4) 

On the right-hand side of this expression, the angular momentum L of the 
particle appears : 

L = RxP = — PxR (5) 

We can therefore write H in the form: 

H = H 0 + H l + H 2 (6) 



(7-a) 

(7-b) 

(7-c) 
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In these relations, fx B denotes the Bohr magneton (whose dimensions are those 
of a magnetic moment): 


Mb = 


2 m e 


( 8 ) 


and the operator R x is the projection of R onto a plane perpendicular to B: 


R 2 = R 2 


(R B ) 2 

B 2 


(9) 


If we choose a system of orthonormal axes Oxyz such that B is parallel to Oz, we 
have: 


R 2 ± = X 2 + Y 2 


( 10 ) 


comment: 


When the field B is zero, H becomes equal to H 0 , which is the sum of the kinetic 
energy P 2 /2 m e and the potential energy F(R). Nevertheless, we must not conclude from 
this that when B is not zero, P 2 /2 m e still represents the kinetic energy of the electron. 
We have seen (cf. complement H ni ) that the physical meaning of operators acting in 
the state space changes when the vector potential is not zero. For example, the momen¬ 
tum P no longer represents the mechanical momentum n = m e V, and the kinetic 
energy is then equal to : 


n 2 

2 m e 



[P - <?A(R)] 2 


( 11 ) 


The meaning of the term P 2 /2m e , taken alone, depends on the gauge chosen. With the 
one defined by (2), it can easily be shown to correspond to the “relative” kinetic 
energy II|/2 m e , where II* is the mechanical momentum of the particle with respect to 
the “Larmor frame” rotating about B with angular velocity co L = — qB/2m e . The 
term H 2 then describes the kinetic energy nf/2m e related to the drag velocity of the 
frame. As for H v it corresponds to the cross term fl £ . Tl R /m e . 


b. ORDER OF MAGNITUDE OF THE VARIOUS TERMS 


In the presence of the magnetic field B, two new terms, H x and H 2 , therefore 
appear in H. Before examining their physical meaning in greater detail, we shall 
calculate the order of magnitude of the energy differences AE (or the frequency 
differences A E/h) associated with them. 

As far as H 0 is concerned, we already know the corresponding energy 
differences AE 0 {cf. chap. VII). The associated frequencies are of the order of: 

a 10 14 to 10 15 Hz (12) 


Also, by using ( 7 -b), we see that AE X is approximately given by: 


AE X 


1 hB 


(Ol 

2n 


(13) 
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where a> L is the Larmor angular velocity*: 


A simple numerical calculation shows that, for an electron, the Larmor frequency 
is such that: 

- 1.40 x 10 10 Hz/tesla = 1.40 MHz/gauss (15) 

n 2 tzd 

Now, with the fields usually produced in the laboratory (which rarely exceed 
100,000 gauss), we have: 

< 10 11 Hz (16) 

Comparing (12) and (16), we see that: 

AE, « AE 0 (17) 

Let us show, similarly, that: 

AE 2 « AE t (18) 

To do so, we shall evaluate the order of magnitude AE 2 of the energies associated 
with H 2 . The matrix elements of the operator R* = X 2 + Y 2 are of the same 
order of magnitude as al, where a 0 = h 2 /m e e 2 characterizes atomic dimensions. 
Thus we obtain: 


We then find the ratio: 


AE 2 . q 2 B 2 2 1 n qB mX 

A T? -fc ^ ♦ 0 


m e h 2 


Now, according to formulas (C-12-a) and (C-12-b) of chapter VII: 


Relation (20) therefore yields, with (13) taken into account: 

AE, ~ AEy 
AE t ~ AE 0 

which, according to (17), proves (18). 


* Note that the Larmor frequency —- is half the cyclotron frequency. 

2u 
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Therefore, the effects of the magnetic field always remain, in practice, much 
smaller than those due to the internal field of the atom. Moreover, it is generally 
sufficient, when we study them, to retain only the term H t , compared to which H 2 
is negligible ( H 2 will be taken into account only in the special cases in which the 
contribution of H 1 is zero)*. 


c. 


INTERPRETATION OF THE PARAMAGNETIC TERM 


Consider, first of all, the term H l given by (7-b). We shall see that it can be 
interpreted to be the coupling energy — M, . B of the field B and the magnetic 
moment M, related to the revolution of the electron in its orbit. 

For this purpose, we shall begin by calculating the magnetic moment At clas¬ 
sically associated with a charge q in a circular orbit of radius r (fig. 1). If the speed 
of the particle is v, its motion is equivalent to a current: 


» = <7 


2nr 


Since the surface S defined by this current is: 
S = nr 2 

the magnetic moment At is given by: 


(23) 


(24) 


\At\ = i x S = -krv 


(25) 





FIGURE 1 

Classically, the motion of an electron in its orbit can be 
regarded as a current loop of magnetic moment M. 


Introducing the angular momentum if, which, since the velocity is tangential, has 
a modulus of: 

|if | = m e rv (26) 

we can write (25) in the form: 



(27) 


* The Zeeman effect of a three-dimensional harmonic oscillator can be calculated without approxi¬ 
mations (cf. problem 2 of complement G VII ). This is true because T(R) and H, then have analogous forms. 
This example is interesting since it enables us to analyze the contributions of //, and H 2 in a soluble case. 
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(this is a vector relation since & and M are parallel, as both are perpendicular 
to the plane of the classical orbit). 

The quantum mechanical analogue of (27) is the operator relation: 

M ‘ = 2 k L (28) 

We can therefore write H l in the form: 

H x = -M, .B (29) 

This confirms the interpretation given above: H x corresponds to the coupling 
between the magnetic field B and the permanent atomic magnetic moment 
(M, is independent of B). H x is called the paramagnetic coupling term. 


comments: 

(/) According to (28), the eigenvalues of any component of the magnetic 
moment M, are of the form: 



where m is an integer. n B therefore gives the order of magnitude of the 
magnetic moment associated with the orbital moment of the electron. This 
is why definition (8) is useful. In the MK.SA system: 

H B ~ — 9.27 x 10“ 24 Joule/tesla (31) 

(//) As we shall see in chapter IX, the electron possesses, in addition to the orbital 
angular momentum L, a spin angular momentum S. With this observable 
is associated a magnetic moment, M s , proportional to S: 

M s = 2^S (32) 

Although the magnetic effects due to the spin are important, we shall 
ignore them for now (we shall return to them in complement D xll ). 

(iii) The classical argument presented above is not completely correct. We have 
confused the angular momentum: 

i?=rxp (33) 

with the moment of the mechanical momentum: 

X = r x m e \ = Jj? — q r x A(r) (34) 

In fact, the error is small. As we shall see in the next section it simply 
amounts to neglecting H 2 relative to H x . 
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d. INTERPRETATION OF THE DIAMAGNETIC TERM 

Consider a zero angular momentum state of the hydrogen atom (for 
example, the ground state). The correction supplied by H t to the energy of this 
state is also zero. Thus, to determine the effect of the fielo B, we must now take into 
account the presence of H 2 . How should the corresponding energy be interpreted? 

We have seen (cf. complement C VI1 , §2-b) that, in the presence of a uniform 
magnetic field, the probability current associated with the electron is modified. 
This current is cylindrically symmetrical with respect to B. It corresponds to a 
uniform rotation of the probability fluid, clockwise if q is positive and counter¬ 
clockwise if q is negative. With the corresponding electric current is then associated 
a magnetic moment < M 2 > antiparallel to B, and, therefore, a positive coupling 
energy, which explains the physical origin of the term H 2 . 

To see this more precisely, we shall return to the classical argument of the 
preceding section, taking into account the fact [ cf. comment (Hi) of §c] that 
the magnetic moment Jt is in fact proportional to X — r x m e v (and not 
to S'= r x p): 




(35) 


When S’ is zero, M reduces, in gauge (2), to: 

*1 - ST r x < r * B > " -& K r ' B > r - r2R ] 


(36) 


m 2 is proportional to the value of the magnetic field*. It therefore represents the 
moment induced by B in the atom. Its coupling energy with B is: 


W 7 


~f 


Jf 2 ( B'). dB' = - i Jt 2 (B ). B 


8m, 


[r 2 B 2 - (r . B) 2 ] 


- q r 2 R 2 


(37) 


as we found in (7-c). Therefore, the interpretation given above has been confirmed : 
H 2 describes the coupling between the field B and the magnetic moment M 2 
induced in the atom. Since the induced moment, according to Lenz’ law, opposes 
the applied field, the coupling energy is positive. H 2 is called the diamagnetic 
term of the Hamiltonian. 


* is not colli near with B. However, it can be shown that, in the ground state of the hydrogen 
atom, the mean value < M 2 > of the operator associated with is antiparallel to B. This is in agreement 
with the result obtained above from the structure of the probability current. 
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comment: 

As we have already pointed out [cf. (18)], atomic diamagnetism is 
a weak phenomenon which is concealed by paramagnetism when both are 
present. As is shown by (37) (and the calculations of § l-b), this result 
is related to the small size of the atomic radius : for magnetic fields of the 
type usually produced, the magnetic flux intercepted by an atom is very 
small. It must not be concluded that we can always neglect H 2 relative to H v 
whatever the physical problem. For example, in the case of a free electron (for 
which the radius of the classical orbit would be infinite in a zero magnetic 
field), we saw in complement E VI that the contribution of the diamagnetic 
term is as important as that of the paramagnetic term. 


2. The Zeeman effect 

Now that we have explained the physical significance of the various terms 
appearing in the Hamiltonian, we shall look more closely at their effects on the 
spectrum of the hydrogen atom. More precisely, we shall examine the way in which 
the emission of the optical line called the “ resonance line” (-1 ~ 1 200 A) is modified 
when the hydrogen atom is placed in a static magnetic field. We shall see that this 
changes not only the frequency, but also the polarization, of the atomic lines : 
this is what is usually called the “Zeeman effect”. 

Important comment: In reality, because of the existence of electron and proton 
spins, the resonance line of hydrogen includes several neighboring components (fine 
and hyperfine structure; cf. chap. XII). Moreover, the spin degrees of freedom 
profoundly modify the effect of a magnetic field on the various components of 
the resonance line (the Zeeman effect of the hydrogen atom is sometimes called 
“anomalous”). Since we are ignoring the effects of spin here, the following calcu¬ 
lations do not truly correspond to the real physical situation. However, they can 
easily be generalized to take spins into account {cf. complement D x „). Moreover, 
the results we shall obtain (the appearance of several Zeeman components of 
different frequencies and polarizations) remain qualitatively valid. 


a. ENERGY LEVELS OF THE ATOM 

IN THE PRESENCE OF THE MAGNETIC FIELD 

The resonance line of hydrogen corresponds to an atomic transition between 
the ground state Is (n = 1; / = m = 0) and the excited state 2p (n = 2; / = 1; 
m = + 1,0, — 1). While the angular momentum is zero in the ground state, it is 
not so in the excited state; in calculating optical line modifications in the presence 
of the magnetic field B, we therefore make a small error by neglecting the effects 
of the diamagnetic term H 2 , which amounts to taking H 0 + //, for the Hamiltonian. 
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We denote by |<p n(m > the common eigenstates of H 0 (eigenvalue 
E n = — Ej/n 2 ), L 2 [eigenvalue /(/ + 1 )fi 2 ] and L z (eigenvalue nth). The wave 
functions of these states are those which were calculated in chapter VII: 

<Pn,,Jr,e,q>) = R nJ (r)YT{e,(p) (38) 

We choose the Oz axis parallel to B; it is not difficult to see that the states | <p„ (m > 
are then also eigenvectors of H 0 + H{. 


(H 0 + HJ | ) = (h o -*±BL ^\ m > 


= (E n ~ WbB) | > (39) 

If we neglect the diamagnetic term, the stationary states of the atom placed in the 
field B are therefore still the | (p n l m ); only the corresponding energies are 
modified. 

In particular, for the states involved in the resonance line, we see that: 


(tfo + ^i) | ‘IPi,o,o ) — E t | <Pi >0 o ) 

(^0 + ^l) [ *P2,1 ,m y = [ — Ej + Tl(Q + WCO L )] | <P2,l,m ) 


where: 


B-Si. 


£, _ 3 E, 


(40-a) 

(40-b) 


is the angular frequency of the resonance line in a zero field. 


b. ELECTRIC DIPOLE OSCILLATIONS 

a. Matrix elements of the operator associated with the dipole 
Let: 

D = ?R (42) 

be the electric dipole operator of the atom. To calculate the mean value < D > of this 
dipole, we begin by evaluating the matrix elements of D. 

Under reflection through the origin, D is changed into — D : the electric 
dipole is therefore an odd operator ( cf. complement F„). Now, the states | <p n>;>m ) 
also have a well-defined parity: since their angular dependence is given by T"(d, <p), 
their parity is -H 1 if / is even and — 1 if / is odd {cf. complement A V1 ). It follows, 
in particular, that: 

r ( ‘Pi.o.o i ® i < Pi,o,o) = ® 

1 <<P2.1.»'| D|<P2,l, m > = 0 


for all m and m'. 

The non-zero matrix elements of D are therefore necessarily non-diagonal 
elements. To calculate the matrix elements <<jo 2 t m | D | <p x 0 0 ), it is convenient 
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p. Calculation of the mean value of the dipole 

The results of § a indicate that, if the system is in a stationary state, the mean 
value of the operator D is zero. Let us assume, rather, that the state vector of the 
system is initially a linear superposition of the ground state Is and one of the 2 p states: 

I 'Z'm(O) > = cos a | <p li0 , 0 > + sin a | (p 2 l m > (47) 

with m = + 1, 0 or — 1 (a is a real parameter). We then immediately obtain the 
state vector at time t: 

I ^m(0 > = cos a | <p 1-0>0 > + sin ae" i(ft+m “^ )( | <p 2 1>m ) (48) 

(we have omitted the global phase factor e‘ £ '' /fi , which is of no physical consequence). 
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To calculate the mean value of the electric dipole: 
< D } m (t) = < ip m {t) | D | > 

we shall use results (46) and (48), and cite three cases: 

(i) if m = 1, we obtain: 


<*> x \ 

<^>i 


—sin 2a cos [(£? + a> L )r] 
v 6 

—%= sin 2a sin [(D 4- c o L )t] 
v 6 


<^ 2 >i =0 


(49) 


(50) 


The vector <D) x (t) therefore rotates in the xOy plane about the Oz axis, in the 
counterclockwise direction and with the angular velocity 0 + a> L . 

( ii) if m = 0: 

'< D X > 0 = < > 0 = 0 

< D 2 > 0 = sin 2a cos Qt (51) 

V3 


The motion of < D ) 0 (r) is now a linear oscillation along the Oz axis, of angular fre¬ 
quency Q. 

(iii) if m = — 1: 

< D x >-i = sin 2a cos [(G - co L )t] 
v 6 


< D y >_! = - sin 2a sin [(fl - cojr] 
=0 


(52) 


The vector <D>_ t (f) again rotates in the xOy plane about Oz, but this time in 
the clockwise direction and with the angular velocity Q — co L . 


c. FREQUENCY AND POLARIZATION OF EMITTED RADIATION 

In the three cases (m = + 1,0 and — 1), the mean value of the electric dipole 
is an oscillating function of time. It is clear that such a dipole radiates electromagnetic 
energy. 

Since the atomic dimensions are negligible compared to the optical wavelength, 
the atom’s radiation at great distances can be treated like that of a dipole. We shall 
assume that the characteristics of the light emitted (or absorbed) by the atom 
during transition between a state |<p 2 ,i,m) anc * the ground state are correctly given 
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by the classical calculation of radiation of a dipole* which is equal to the mean 
quantum mechanical value < D > m (f). 

To state the problem precisely, we shall assume that we want to study the 
radiation emitted by a sample containing a great number of hydrogen atoms, 
which have somehow been excited into a 2p state. In most experiments actually 
performed, the excitation of the atoms is isotropic, and the three states \q> 2 , , >, 
|</> 2 10 > and |<P 2 ,i,-i> occur with equal probability. Therefore, we shall begin 
by calculating the radiation diagram for each of the cases of the preceding 
sections. Then we shall obtain the radiation actually emitted by the atomic 
system by taking, for each spatial direction, the sum of the light intensities 
emitted in each case. 

(j) If m = 1, the angular frequency of the emitted radiation is (£2 + a> L ). 
The optical line frequency is therefore slightly shifted by the magnetic field. In 
accordance with the laws of classical electromagnetism applied to a rotating 
dipole such as < D>, (i), the radiation emitted in the Oz direction is circularly 
polarized (the corresponding polarization is called <t + ). However, the radiation 
emitted in a direction of the xOy plane is linearly polarized (parallel to this plane). 
In other directions, the polarization is elliptical. 

(h) If m = 0, we must consider a dipole oscillating linearly along Oz, with 
angular frequency £2, that is, the same as in a zero field. The wavelength of the 
radiation is therefore not changed by the field B. Its polarization is always linear, 
whatever the propagation direction being considered. For example, for a propagation 
direction situated in the xOy plane, this polarization is parallel to Oz (n pola¬ 
rization). No radiation is emitted in the Oz direction (an oscillating linear dipole 
does not radiate along its axis). 

(iii) If m — — 1, the results are analogous to those for m = 1. The only 
difference is that the angular frequency of the radiation is (£2 — u> L ) instead 
of (£2 + a> L ), and the dipole rotates in the opposite direction; this changes, for 
example, the direction of the circular polarization (<r_ polarization). 

If we now assume that there are equal numbers of excited atoms in the three 
states m — + 1,0 and — 1, we see that : 

— in an arbitrary spatial direction, three optical frequencies are emitted : 
£2/2n, (£2 ± (o L )/2n. The polarization associated with the first one is linear, and 
that associated with the others is, in general, elliptical; 

— in a direction perpendicular to the field B, the three polarizations are 
linear (cf. fig. 2). The first one is parallel to B, and the other two are perpendicular. 
The intensity of the central line is twice that of each of the shifted lines [cf for¬ 
mulas (50), (51) and (52)]. In a direction parallel to B, only the two shifted fre¬ 
quencies (£ ± (o L )/2n are emitted, and the associated light polarizations are both 
circular but opposite in direction (cf. fig. 3). 


* If we wanted to treat the problem entirely quantum mechanically, we should have to use the 
quantum mechanical theory of radiation. In particular, the return of the atom to the ground state by 
spontaneous emission of a photon could only be understood in the framework of this theory. However, 
the results we shall obtain here semi-classically would remain essentially valid as far as radiation is 
concerned. 
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The Zeeman components of the resonance line 
of hydrogen observed in a direction perpendi¬ 
cular to the magnetic field B (ignoring electron 
spin). We obtain a component of unshifted 
frequency v, polarized parallel to B, and two 
shifted components ± co l I2k 9 polarized per¬ 
pendicularly to B. 


When the observation is carried out along the 
direction of the field B, only two Zeeman 
components are obtained, circularly polarized 
in opposite directions and shifted by ± co L /2n. 


comment: 

The atom therefore emits <x + -polarized radiation in going from the 
state | <j? 2i i,i > to the state ki.o.oX o- in going from \<p 2 x _ x > to |<jD lj0 ,o>> 
and n in going from |<p 2 ,i,o) t0 |<Pi,o,o)- Formulas (46) furnish a simple 
rule for finding these polarizations. Consider the operators D x -I- iD y , 
D x — iD y and D z ; their only non-zero matrix elements between the 2 p and 
Is states taken in this order are: 

<02,1,1 | + iD y | Pi.o.o X < 02,1,-1 | Ac “ iDy | 01.0,0 > 

and < 02 ,i ,o | A 10i,o,o X 

To the g + , a_ and n polarizations, therefore, correspond the opera¬ 
tors D x -)- iD y , D x — iD y and D z , respectively. This is a general rule: there 
is emission of electric dipole radiation when the operator D has a non-zero 
matrix element between the atom’s initial state and its final state. The polari¬ 
zation of this radiation is a + , er_ or n depending on whether the non-zero 
matrix element* is that of D x + iD y , D x — iD y or D z . 


References and suggestions for further reading: 

Paramagnetism and diamagnetism: Feynman II (7.2), chaps. 34 and 35; 
Cagnac and Pebay-Peyroula (11.2), chaps. VIII and IX; Kittel (13.2), chap. 14; Slater (1.6), 
chap. 14; Fliigge (1.24), §§128 and 160. 

Dipole radiation: Cagnac and Pebay-Peyroula (11.2), Annex III; Panofsky and 
Phillips (7.6), §14-7; Jackson (7.5), §9-2. 

Angular momentum of radiation and selection rules : Cagnac and Pebay-Peyroula 
(11.2), chap. XI. 


♦ The order of the states in the matrix element must be respected in order not to confuse <r + with o. 
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Complement E V n 

SOME ATOMIC ORBITALS. 
HYBRID ORBITALS 


1. Introduction 

2. Atomic orbitals associated with real wave functions 

a. s orbitals (/ = 0) 

b. p orbitals (/ = 1) 

c. Other values of l 

3. sp hybridization 

a. Introduction of sp hybrid orbitals 

b. Properties of sp hybrid orbitals 

c. Example: the structure of acetylene 

4. sp 2 hybridization 

a. Introduction of sp 2 hybrid orbitals 

b. Properties of sp 2 hybrid orbitals 

c. Example: the structure of ethylene 

5. sp* hybridization 

a. Introduction of sp 3 hybrid orbitals 

b. Properties of sp* hybrid orbitals 

c. Example: the structure of methane 


1. Introduction 

In §C of chapter VII, we determined an orthonormal basis of stationary 
states for the electron of the hydrogen atom. The corresponding wave functions 

are: <Pn. l Jr)= /UOW <f>) 0) 

and the quantum numbers «, /, m refer, respectively, to the energy E n — — E,/n 2 , 
the square of the angular momentum /(/ + 1 )fi 2 , and the Oz-component of the angu¬ 
lar momentum mh. 

By linearly superposing stationary states of the same energy, that is, of the 
same quantum number n, we can construct new stationary states which no longer 
necessarily correspond to well-defined values of / and m. In this complement, 
we intend to study the properties of some of these new stationary states — in 
particular, the angular dependence of the associated wave functions. 

The wave functions (1) are often called atomic orbitals. A linear superposition 
of orbitals of the same n but different / and m is called a hybrid orbital. We shall see 
that a hybrid orbital can extend further in certain spatial directions than the (pure) 
orbitals from which it is constructed. It is this property, important in the formation 
of chemical bonds, which justifies the introduction of hybrid orbitals. 

Although the calculations presented in this complement are rigorously valid 
only for the hydrogen atom, we shall also indicate qualitatively how they explain 
the geometrical structure of the various bonds formed by an atom with several 
valence electrons. 
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2. Atomic orbitals associated 

with real wave functions 

In expression (1), the radial function R n l {r) is real. However, Y^(d, q>), except 
for m = 0, is a complex function of <p, since: 

17(0, <P) = ^7(0) e im * (2) 

where F™(0) is a real function of 6. 

Atomic orbitals are therefore generally complex functions. By superposing 
the (p n l m ( r) and orbitals, one can, however, construct real orbitals whose 

advantage is their simple angular dependence, which can be represented graphically 
without having to take the square of the modulus of the wave function (as we did 
in §C-4-c-a of chapter VII). 

a. s ORBITALS (/ = 0) 

When l = m = 0, the wave function <p n00 (r) is real, and we say we are dealing 
with an “s orbital”. We shall denote the corresponding stationary state by | q > ns ). 
To represent the angular dependence of the ns orbital, we fix r and measure off in each 
direction of polar angles 6 and (p a line segment of length <p, s (r, 6, <p). The surface 
obtained by varying d and q> is a sphere centered at O (fig. 1). 



FIGURE I 

An s orbital is spherically symmetrical: the wave 
function depends on neither 6 nor . 


b. p ORBITALS (/ = 1) 


a. p z ,p x ,p y orbitals 

If we use the expression for the three spherical harmonics Y”(6, (p) given in 
complement A v , [formulas (32)1, we obtain, for the three atomic orbitals (p , (r) 
corresponding to / = 1: 


i( r ) = ~ 




V 


rj 

<PnA.o( r ) = ^Kn.lMcOSfl 

fj 

<Pn, 1.- i(r) = R„A r ) sin 0 e » 


(3-a) 


(3-b) 


(3-c) 
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We now form the three linear superpositions: 


<P„,i.o( r ) 

(4-a) 

-7= l>„.i,i( r ) - i(r)] 

v 2 

(4-b) 

-^p[> n ,l.l( r ) + <p n ,i.-i( r )] 

V2 

(4-c) 


It is easy to see that the three preceding wave functions can also be written: 


rj 

\4- R„ At 

■\j 4n B ’ 1V 

V Z 

(5-a) 

>7 

Ji R 


(5-b) 


»7 

(5-c) 




Two possible representations of a p z orbital (/ = 1, m = 0) 

fig. at angular dependence of this orbital. With r fixed, we display |<p rt , = l m=0 (r, 6 , (p)\ for each 
direction 6 , <p. Thus we obtain two spheres, tangential at O to the xOy plane. The sign indicated on 
each of them is that of the wave function (which is real). 

fig. b: cross sections in the xOz plane of a family of surfaces, each one corresponding to a given value 
for \(p nJ = li(ri . 0 (r, <p) I [we have chosen values equal to 0.2, 0.6, 0.9 times the maximum value of \(p\ 
at points A and £]. These are surfaces of revolution about Oz . The sign indicated is that of the 
wave function (which is real). Unlike the one in figure a, the representation in figure b depends on 
the radial part of the wave function (the one chosen here corresponds to the n = 2 state of the hydrogen 
atom). 
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These are real functions of r, 9, (p which, like the <p n l m (r), are orthonormal and 
form a basis in the subspace & nJ=l . They are called, respectively, “p z , p x , p y orbi¬ 
tals”. We shall write the wave functions (5) <p„ Pi (r), (p„ Px { r) and <p„ Py (r). 

Two distinct geometrical representations enable us to visualize the form 
of an orbital <j/{r , 9, cp). First of all, if we are interested in the angular dependence 
of the orbital, we fix r and measure off a line segment of length |^(r, 9, cp)\ along 
each direction of polar angles 9 and cp. Thus, the angular dependence of the 2 p z orbital 
is that of z/r = cos 9. As (p varies between 0 and 2n, and 9 varies between 0 and n, 
the end of the line segment of length |cos 9\ drawn in the direction of polar 
angles 9 and <p describes two spheres centered on the Oz axis, tangential at O to 
the xOy plane and mirror images with respect to the xOy plane (fig. 2-a). The sign 
indicated in the figure is that of the wave function, which is real. Another possible 
representation of the orbital 9, cp) is obtained by tracing a family of surfaces, 
each one corresponding to a given value of \ij/(r, 9, cp)\ (surfaces of equal probability 
density). This is what is done for the 2 p z orbital in figure 2-b (here again, the sign 
indicated is that of the wave function, which is real). In the rest of this complement, 
we shall use one or the other of these two representations. 

The p x and p y orbitals can be obtained respectively from the p z orbitals by 
rotations through angles of + njl and - nil about Oy and Ox ( cf. figures 3 and 4, 
which use a geometric representation identical to the one in figure 2-a). 



FIGURE 3 

Angular dependence of a p x orbital (the repre¬ 
sentation adopted is that of figure 2-a). 



FIGURE 4 

Angular dependence of a p y orbital. 


Unlike an s orbital, which is spherically symmetrical, the p z , p x , p y orbitals 
therefore point along the Oz, Ox, Oy axes, respectively. 
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(3. p u orbitals 

The choice of the Ox, Oy, Oz axes is obviously arbitrary. By linearly 

superposing the p x , p y and p z orbitals, we should therefore be able to construct 

a p u orbital having the same form but directed along an arbitrary Ou axis. 

Let Ou be such an axis, forming angles a, p, y with Ox, Oy, Oz. We have, 

obviously: 

cos 2 a + cos 2 P + cos 2 y = 1 (6) 

Consider the state: 

cos a | np x > + cos p \ np y > + cos y | np z > (7) 


which, according to (6), is normalized. We can, using formulas (5), put the 
corresponding wave function in the form: 



x cos a + y cos j 3 + z cos y 
r 



( 8 ) 


where: 

u = x cos a -f y cos p + z cos y (9) 

is the projection of r onto the Ou axis. Comparison with (5) indicates that the orbital 
so constructed is indeed a p u orbital. 

Therefore, any real and normalized linear superposition of p x , p y and p z 
orbitals: 

* <P nPx (* ) + H <Pn Py {r) + v (p„ Pi (r) (10) 

can be considered to be a p u orbital directed along the Ou direction, defined by : 
'cos a = X 

< cos p = p (11) 

^cos y = v 


y. Example: structure of the H 2 0 and H 3 N molecules 

To a first approximation (cf complement A XIV ), in a many-electron atom, each electron 
can be considered to move independently of the others in a central potential V c (r) which is the 
sum of the electrostatic attractive potential of the nucleus and a “mean potential” due to the 
repulsion of the other electrons. Each electron can therefore be found in a state characterized 
by three quantum numbers, n, l, m. However, since the potential V c (r) no longer varies exactly 
like l/r, the energy no longer depends only on n, but also on /. We shall see in complement A XIV 
that the energy of the 2s state is slightly lower than that of the 2p state; the 3 s state is also lower 
than the 3 p state, which is, in turn, lower than the 3 d state, etc. 

The existence of spin and Pauli’s principle (which we shall study in chapters IX and XIV) 
imply that the Is, 2s, ... sub-shells can contain only two electrons; the 2 p, 3 p, ... sub-shells, six 
electrons, ...; the nl sub-shells, 2(2/ + 1) electrons (the factor 21 + 1 arises from the degeneracy 
related to L„ and the factor 2, from the electron spin). 
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Thus, for the oxygen atom, which has eight electrons, the Is and 2s sub-shells are filled 
and contain four electrons in all. The four remaining electrons are in the 2 p sub-shell: two 
of them (with opposite spins) can fill one of the three 2 p orbitals, for example, 2 p z \ the other two 
are then distributed in the remaining 2 p x and 2 p y orbitals. These last two electrons are the 
valence electrons: they are “unpaired”, which means that the orbitals in which they can be 
found can accept another electron. The 2 p x and 2 p y wave functions of the valence electrons of 
oxygen are therefore directed along two perpendicular axes. Now, it can be shown that, the 
greater the overlapping of the wave functions of the two electrons participating in a chemical 
bond, the greater the stability of this bond. The two hydrogen atoms which will bind with the 
oxygen atom to form a water molecule must therefore be centered respectively on the Ox and 
Oy axes. Then the spherical Is orbital of the valence electron of each hydrogen will maximally 
overlap one of the 2 p x and 2 ,p y orbitals of the valence electrons of oxygen. Figure 5 represents 
the shape of the probability clouds associated with the valence electrons of the oxygen and 
hydrogen atoms in the water molecule. The graphical representation used is analogous to the 
one in figure 2-b. We have drawn, for each electron, a surface defined as follows: the probability 
density has the same value at all points of this surface; this value is chosen in such a way that 
the total probability contained inside the surface has a fixed value close to 1 (0.9, for example). 

The preceding argument enables us to understand the form of the H 2 0 molecule. The 
angle between the two OH bonds should be close to 90°. Actually, the angle found experimen¬ 
tally is 104°. The deviation from the value 90° arises, in part, from the electrostatic repulsion 
between the two protons of the hydrogen atoms, which tends to open up the angle between the 
two OH bonds*. 



figure 5 

Schematic structure of the water molecule 
H 2 0. The 2 p x and 2 p y orbitals yields 
bonds making an angle of approximately 
90° (the real angle is 104° because of the 
electrostatic repulsion between the two 
protons). 


An analogous argument explains the pyramidal form of the NH S molecule. The three 
valence electrons of nitrogen occupy the 2 p x , 2 p r 2p z orbitals, directed at right angles to each 
other. Here again, the electrostatic repulsion between the protons of the three hydrogen atoms 
causes the bond angle to go from 90° to 108° (through a slight hybridization of the 2s and 
2 p orbitals). 


* The opening up of the angle between the two OH bonds can be described as the result of a slight sp 3 
hybridization of the 2 p and 2s orbitals (c/. §5). 
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c. OTHER VALUES OF / 

We have confined ourselves thus far to the s and p orbitals. Actually, an 
orthonormal basis of real orbitals can be constructed for every value of /. If we 
note that [c/. relation (D-29) of chapter VI]: 

[W </>)]* = (- iri 7 "(fc«o) (12) 

we immediately see that (for m ^ 0 ) the two complex functions <p„, m (r) and 
<p„ , _ m ( r ) can be replaced by the two functions: 

- 7 = 0»./.m( r ) + (- 1 )>„./. -,»] ( 13-a) 

v 2 

l)> n ./.-Jr)] (13-b) 

v 2 

which are real and orthonormal. 

Thus, for / = 2 (“d orbitals”), we can construct five real orbitals, for which 
the angular dependence is given by: 

^ (3 cos 2 0 — 1 ), \J~6 sin 0 cos 0 cos <p, V 6 sin 0 cos 0 sin ip, 

\ 

,3 n 

— sin 2 0 cos 2 <p, /— sin 2 0 sin 2 <p 
\ 2 \ 2 

—r 2 i ^zy’ ^x 2 - y 2 ’ ^xy Orbitals). 

The form of these orbitals is a little more complicated than that of the 5 and p 
orbitals to which we shall confine ourselves here. However, it is possible to apply 
to them arguments of the same type as those below. 


3. sp hybridization 

a. INTRODUCTION OF sp HYBRID ORBITALS 

Returning to the hydrogen atom, we shall consider the subspace © S np , 
subtended by the four real orbitals <p BS (r), <p np Jr), <p np> (r) and <p„ Pt (r) (which 
correspond to the same energy). We shall show that, by linearly superposing ns 
and np orbitals, we can construct other real orbitals, which form an orthonormal 
basis in S ns © S np and possess some interesting properties. 

We shall begin by linearly superposing the <p ns (r) and <p„ Pi (r) orbitals alone, 
without using <p np Jr) and (p„ Py { r). We shall therefore replace the two functions 
<p BS (r) and <p BPl (r) by the two real orthonormal linear combination: 

f cos a <pj r) + sin x <p„ P2 (r) (14-a) 

\sin a <pj r) - cos x <p„ p Jr) (14-b) 
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In addition, we shall require the two orbitals (14-a) and (14-b) to have the 
same geometrical form. Since this form depends only on the relative amounts of 
the s and p orbitals in the linear superposition, we see immediately that we must 
have sin a = cos a, that is, a = n/ 4. The two new orbitals we are introducing are 
therefore of the form: 

= ~T fr-M + ^»p»] 

V 2 

<Pn,s,pA r ) = -4= l<Pj r ) ~ <Pr,p»] 

V 2 

and correspond to what is called “sp hybridization”. Thus we have constructed 
a new orthonormal basis of S ns © composed of <p„ s Pi (r), (p' nspi (r), (p„ Px (r) 
and q> npy { r). 


(15-a) 
(15-b) 


b. PROPERTIES OF sp HYBRID ORBITALS 

To study the angular dependence of the (p n5Pi (t) and (p'„ fS>Pt ( r) hybrid 
orbitals, we shall choose a given value r 0 of r and set: 



— 7 = (A + p cos 9) 

sTi 


— 7 = (A — p cos 0) 
V2 


(16) 


(17) 


which we shall represent, using the same method as in §2 ( cf. fig. 2 -a), by 
measuring off, along each direction of polar angles 9 and (p, a line segment of 

length — 7 = |A + p cos 9\ or —|A — p cos 0| and indicating by a plus or minus 
V 2 v 2 

sign whether the wave function is positive or negative. Figure 6 represents the 
cross sections in the xOz plane of the surfaces so obtained, which have cylindrical 
symmetry with respect to Oz (we have assumed p > X > 0). The orbital 

can be transformed into the <p' s Pi (r) by a reflection through the point O. Ii can 
be seen that the ^ n s Pi (r) orbital has no simple symmetry with respect to the 
point O. This asymmetry is due to the fact that the (p np Jr) and (p m {r) orbitals 
of which it is formed (and which are shown in figure 6 -c) have opposite parities. 
In the region where z > 0, <p ns (r) and (p„ Pl { r) have the same sign and add, while 
in the region where z < 0 , <p ns (r) and (p„ Pl (r) have opposite signs and subtract. 
The conclusions are reversed for <jo^ iSiPi (r). 
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FIGURE 6 

Angular dependence of the (p„ t5tP% ( r) (fig* a) and <p' n s Pi (r)(fig. b), hybrid orbitals obtained from the 
q> n s (r) and <p n (r) orbitals, which have opposite parities (fig. c). A hybrid orbital can extend further 
in certain directions than the pure orbitals from which it is obtained. 


The (p„' StPx ( r) orbital therefore extends further in the positive direction of 
the Oz axis than in the negative direction since, for fixed r, the values it takes 
on are greater (in absolute value) for 8 = 0 than for 6 = n. In general, for large 
values of r, X and p are such that the values of the <jP„, s>Pz (r) orbital in the positive 
direction of the Oz axis are larger than those taken on separately by the <p BS (r) 
and (p„ Pt ( r) orbitals [the same conclusions are valid for the (p’ n s Pz (r) orbital and 
the negative direction of the Oz axis]. 

This property plays an important role in the study of the chemical bond. 
To understand this qualitatively, assume that, in a particular atom A, one of the 
valence electrons can be either in the ns orbital or in one of the np orbitals. Then 
suppose that another atom B is in the neighborhood of the first one, and we call 
Oz the axis joining A and B. The <p n J pi (r) orbital of A will overlap more of the 
orbitals of the valence electrons of B than the <p B5 (r) or <jp„ Pz (r) orbitals. Thus we see 
that hybridization of the orbitals of A can lead to a greater stability of the chemical 
bond, since this stability increases, as we have already pointed out, with the 
overlapping of the electronic orbitals of A and B involved in the bond. 


c. EXAMPLE: THE STRUCTURE OF ACETYLENE 

The carbon atom has six electrons. When this atom is free, two of these electrons are in 
the 1 j sub-shell, two in the 2s sub-shell, and two in the 2 p sub-shell. Only these last two are 
unpaired, and we should therefore expect carbon to be bivalent. This is indeed what is observed 
in some of its compounds. However, carbon is usually present in a quadrivalent form. This 
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arises from the fact that when a carbon atom is bound to other atoms, one of its 2s electrons 
can leave this sub-shell and place itself in the third 2p orbital, which is unoccupied in the free 
carbon atom. There are then four unpaired electrons, whose wave functions are the result 
of a hybridization of the four orbitals, 2s, 2 p xi 2\p y and 2 p z . 

Thus, in the acetylene molecule C 2 H 2 , the four valence electrons of each carbon atom 
are distributed as follows: two electrons are in the <p 2 , s , Px { r ) and <Pi, s , Pz (r) hybrid orbitals we 
have just introduced, and the other two are in the <p 2px ( r) and cp 2py (r) orbitals studied in §2-b. 
According to figures 6-a and 6-b, the two electrons of each carbon atom which occupy 
toe (p 2 , s , Pl (r) and (p' lfStPz { r) hybrid orbitals participate in bonds separated by an angle of 180° : 
the first one with the other carbon atom, and the second one with one of the two hydrogen 
atoms (whose valence electrons occupy Is orbitals). Thus we understand why the C 2 H 2 molecule 
is linear ( cj\ fig. 7, where we have used the same type of graphical representation as in figure 5). 

As for the 2p x orbitals centered on each of the carbon atoms, they present a partial 
lateral overlapping, as do the two 2 p y orbitals, as is shown by the solid lines in figure 7. They 
contribute to the reinforcement of the chemical stability of the molecule. The two carbon atoms 
thus form a triple bond between them. One bond is produced by the <p 2>s pz (r) and <p 2 s Pz (r) hybrid 
orbitals, each centered on one of the two atoms and cylindrically symmetrical with respect to the 
Oz axis (o bond). Two bonds are produced by the <p 2pjc (r) and < P 2 Py ( r ) orbitals, which are 
symmetrical with respect to the xOz and yOz planes (n bonds). 





FIGURE 7 

Schematic structure of the acetylene molecule 
C 2 H 2 . For each carbon atom, two electrons are 
in the sp z hybrid orbitals ( cf. fig. 6) and contri¬ 
bute to the C — H and C —C bonds (er bonds). In 
addition, two electrons are in the p x and p y orbi¬ 
tals and form additional bonds between the two 
carbon atoms (n bonds, weaker than <r bonds), 
shown by the vertical lines in the figure. The C—C 
bond is therefore a “ triple bond”. 
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comment: 

As we have already pointed out, the 2 p sub-shell, in a many-electron atom, has 
an energy greater than that of the 2? sub-shell. The movement of an electron from the 
2s sub-shell to the 2 p sub-shell is therefore not energetically favorable. However, the 
energy needed for this excitation is fully compensated by the increase in stability due 
to the hybrid orbitals involved in the C —H and C —C bonds. 


4. sp 2 hybridization 


a. INTRODUCTION OF sp 2 HYBRID ORBITALS 


We shall now return to the four orbitals <p ws (r), <P„ Px ( r), (p„ Py ( r) and (p np ( r), 
and replace the first three by the three following real combinations: 

'<P». s . P ,. Py ( r ) = a (pjr) + b (p np J r) + c (p„ Py ( r) (18-a) 

< <P' n . s . Px , Pv ( r ) = "V„ s ( r ) + h '<P np J r ) + ( 18 -b) 

.<PY. s .p x .p>) = + &>„„>) + f>„ Pv (r) (18-c) 


We require the three wave functions (18) to be equivalent, that is, to be transformable 
into each other under rotation about Oz. Consequently, the proportion of the 
<p„ s (r) orbital must be the same in each of them: 

a = a' = a" (19) 

It is always possible to choose the axes so as to make the first orbital (18-a) 
symmetrical about the xOz plane. We can therefore choose: 

c = 0 (20) 

By taking the three orbitals (18) to be normalized and orthogonal, we obtain 
six relations which enable us to determine* the six coefficients a, b, b\ b'\ <•', c". 
A simple calculation yields: 


< Pn.s,p x .py( 1 ’) ~ 
< /\».s.P*.Py( r ) 

< / , n..v,Pv,p v (*") ~ 


\/3 

1 


</>J r ) + . 3 <PnpJ r ) 


\ 


, (pJ r ) - A= <Pn Px (r) + “7= <Pn Py (r) 
V 3 V6 v2 

7s v ” <r) “ v '" <r) " 7i 


(21-a) 
(21-b) 


(21 -c) 


We have thus produced what is called “ sp 2 hybridization”. The three hybrid 
orbitals (21) and the (p„ Pz (r) orbital form a new orthonormal basis in the space 

Ys © &np- 


* Actually, the signs of a, b and c' can be chosen arbitrarily. 
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b. PROPERTIES OF THE sp 2 HYBRID ORBITALS 

We shall use the same graphical representation as in figure 6. 

The <P„ y s,p x ,p y ( r) orbital has cylindrical symmetry with respect to Ox. 
Figure 8-a represents the cross section in the xOy plane of the surface which 
describes its angular dependence for fixed r. The form of the curve obtained 
is completely analogous to that of figure 6-a : the orbital points along the positive 
direction of the Ox axis. 



FIGURE 8 

Angular dependence of the three orthogonal sp 2 orbitals. The <p„, s , Px , Py > < P'n,a. Px . Py and < 'Pn. s . Px . Py orbi¬ 
tals can be transformed into each other by rotations through 120” about Oz. 


By using expression (4-b) for <p nf>x (r), we can easily obtain the action on 
| < p„ Px > of the operator which performs a rotation through an angle a about Oz, 

Q-ia£ x jh. 


e iaL ' lh 1 <P nPx > = cos a | (p„ Px > + sin a | <p„ Py > 

(22) 

Also, we obviously have: 

l<o 

(23) 

Formulas (21) then indicate that: 

1 <, Px . Py > = e~ 2 *'^‘ M | <p n , s , Px ,p y > 

(24-a) 

| tt \ 2i — Lz/fi i v 

1 e I ^n,s,p x ,p y / 

(24-b) 
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The two orbitals (21-b) and (21-c) can therefore be obtained from the orbital (21-a) 
by rotations through angles 2n/3 and — 27c/3 about Oz . Figures (8-b) and (8-c) 
give the cross sections in the xOy plane of the surfaces which describe their angular 
dependence. 


c. EXAMPLE: THE STRUCTURE OF ETHYLENE 

As in the acetylene molecule, each of the two carbon atoms of the ethylene molecule C 2 H 4 
has four valence electrons (one electron in the 2s sub-shell and three electrons in the 2 p sub-shell). 

Three of these four electrons occupy sp 2 hybrid orbitals of the type of those just 
considered. They are the ones which, for each carbon atom, form the bonds with the neighboring 
carbon atom and the two hydrogen atoms of the CH 2 group. Thus we see why the three bonds, 
C —C, C —H, C — H, originating from one carbon atom are coplanar and form angles of 120° 
with each other {cf. fig. 9, in which we have used the same graphical representation as in 
figures 5 and 7). The remaining electron of each carbon atom occupies the 2 p 2 orbital. The 
2 p z orbitals of the two carbons present a partial lateral overlapping, shown by the solid lines 
in figure 9. 


* 


z* 


* 


z 




FIGURE 9 

Schematic structure of the ethylene molecule C 2 H 4 . The two carbon atoms form a double bond 
with each other : one a bond due to sp 2 orbitals of the type of those shown in figure 8 (the other two sp 2 
hybrid orbitals at 120° with this one form the C — H bonds), and one n bond, due to the overlapping 
of the p 2 orbitals. 



The two carbon atoms of the ethylene molecule are therefore connected by a double 
bond : one bond formed by two hybrid orbitals of the sp 2 type, cylindrically symmetrical with 
respect to the Ox axis joining the two carbon atoms (o bond), and one bond formed by 
two 2 p z orbitals, symmetrical about the xOz plane (n bond). It is the latter bond which blocks 
the rotation of one CH 2 group with respect to the other one. If one of the CH 2 groups were 
to rotate with respect to the other one about the axis joining the two carbons, the axes of the 
two orbitals 2 p z and 2p z . (fig. 9) would no longer be parallel. This would diminish their lateral 
overlapping and, consequently, the stability of the system. Thus we see why the six atoms of the 
ethylene molecule are in the same plane. 
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5. sp 3 hybridization 


a. INTRODUCTION OF sp 3 HYBRID ORBITALS 


We shall now superpose the four orbitals, <p„ s (r), (p np J r), (p„ Py ( r), (p„ Pz ( r), 
to form the four hybrid orbitals: 

' (Pn.s. Px . Py . P M) = a <Pns(r) + b <P»pA r ) + c <Pnp y (r) + d </>„„» (25-a) 

< <P' n .s. Px . Py .pM) = ">«>) + h ' <P« Px {r) + <'</>„„>) + d'<Pnp,{r) (25-b) 

< <P».s, Pi . P ,. P2 ( r ) = «> ns ( r ) + b " <Pn Px ( r ) + £"(p„p y (r) + d”<p np (r) (25-c) 

y».s.p x ,p y .P:( r ) = a " ,( P„ s ( r ) + b '"<P n pA r ) + c"’<Pnp y (r) + d'"(p„ Pi (r) (25-d) 


We again require the four orbitals to have the same geometrical form, which 
means that: 

a — a' = a" = a"' (26) 


We can arbitrarily choose the symmetry axis of one of the orbitals, then the plane 
containing this axis and that of a second orbital. This reduces the number of free 
parameters to 10; we can find them by taking the four orbitals (25) to be ortho¬ 
normal. 

We shall content ourselves here with giving a possible set of such hybrid 
orbitals, defined by: 




(27) 


and which can immediately be shown to be orthonormal and of the same geometrical 
form. All the other possible sets can be obtained from this one by rotation. 

We have thus produced what is called a hybridization”. The four 
orbitals (25) corresponding to the coefficients (27) form a new orthonormal basis 
in the space i ns ® S np . 


b. PROPERTIES OF sp 3 HYBRID ORBITALS 


The four orbitals constructed in §5-a are analogous in form to those 
studied in §§3 and 4. They point respectively in the directions of the vectors 
whose components are: 


(blvl) 

(- lv - 1, I) 

" (- 1 , 1 , - 1 ) 
(b -1,-1) 


(28) 
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The axes of the four sp 3 orbitals are therefore arranged like the straight lines 
joining the center of a regular tetrahedron to the four corners of this tetrahedron. 
The angle between any two of these straight lines is equal to 109°28'. 


c. EXAMPLE: THE STRUCTURE OF METHANE 

In the methane molecule CH 4 , the four valence electrons of the carbon atom each occupy 
one of the four sp 3 hybrid orbitals studied above. This immediately explains why the four 
hydrogen atoms form the corners of a regular tetrahedron centered on the carbon atom (fig. 10). 



FIGURE 10 

Schematic structure of the methane 
molecule. The sp 3 orbitals produce 
bonds arranged like the straight lines 
joining the center of a tetrahedron to 
its four comers (angles of 109°28'). 


In the ethane molecule C 2 H 6 , one of the hydrogen atoms of methane is replaced by 
a CH 3 group. The two carbon atoms are then connected by a single bond, formed by 
two sp 3 hybrid orbitals, cylindrically symmetrical with respect to the straight line joining the 
two carbon atoms. The absence of a double bond permits the practically free rotation of 
one CH 3 group with respect to the other one. 


References and suggestions for further reading: 

Various geometrical representations of orbitals: Levine (12.3), §6.6; Karplus 
and Porter (12.1), §3.10. 

Hybrid orbitals : Karplus and Porter (12.1), §6.3; Alonso and Finn III (1.4), 
§5-5; Eyring et al (12.5), chap. XII, §12 b; Coulson (12.6), chap. VIII; Pauling (12.2), 
chap. Ill, §§13 and 14. 
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VIBRATIONAL-ROTATIONAL LEVELS 
OF DIATOMIC MOLECULES 


1. Introduction 

2. Approximate solution of the radial equation 

a. The zero angular momentum states (/ = 0) 

b. General case (l any positive integer) 

c. The vibrational-rotational spectrum 

3. Evaluation of some corrections 

a. More precise study of the form of the effective potential K n (0 
b- Energy levels and wave functions of the stationary states 
c. Interpretation of the various corrections 


1. Introduction 

In this complement, we shall use the results of chapter VII to study quantum 
mechanically the stationary states of the system formed by the two nuclei of a 
diatomic molecule. We shall simultaneously take into account all the degrees of 
freedom of the system: vibration of the two nuclei about their equilibrium 
position and rotation of the system about the center of mass. We shall show 
that the results obtained in complements A v and C vl , in which one degree of 
freedom at a time was considered, are valid to a first approximation. In addition, 
a certain number of corrections due to the “centrifugal distortion” of the 
molecule and to the vibration-rotation coupling will be calculated and interpreted. 

We saw in § 1-a of complement A v (the Born-Oppenheimer approximation) 
that the potential energy V(r) of interaction between the two nuclei depends only 
on the distance r between them and has the form shown in figure 1: V(r) is 



FIGURE 1 

Variation of the interaction poten¬ 
tial energy V(r) between the nuclei 
of a diatomic molecule as a func¬ 
tion of the distance r between 
them. K(r) takes on its minimum 
value — V 0 at r = r e . The first 
vibrational states are represented 
by the horizontal lines in the 
potential well. 
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attractive at large distances, repulsive at short distances, and has a minimum 
at r = r e of depth V 0 . Let m 1 and m 2 be the masses of the two nuclei. Since V{r) 
depends only on r, we can, according to §B of chapter VII, study separately 
the motion of the center of mass (a free particle of mass M = m 1 + m 2 ) and the 
relative motion in the center of mass reference frame, which is equivalent to that 
of a fictitious particle of mass: 


171^2 


m, + m 7 


( 1 ) 


placed in the potential V(r) of figure 1. 

If we are interested only in the relative motion, the stationary states of the 
system, according to the results of §A of chapter VII, are described by the wave 
functions: 


(Pv.ijr, 0, (p) = | u vl {r) Y?{6, (p) 


( 2 ) 


where the corresponding energies E v , and the radial functions u v ,(r) are given 
by the equation: 




L + m+ !l±M 

2/zdr 2 2 jir 2 


K,i( r ) = E v,l u vA r ) 


( 3 ) 


comment: 

Rigorously speaking, we implicitly assume in all of this complement 
(as in A v and C VI ) that the projection of the total orbital angular momentum 
of the electrons onto the internuclear axis is zero, as is their total spin. 
The total angular momentum of the molecule then arises only from the rotation 
of the two nuclei. Such a situation is found in virtually all diatomic molecules 
in their ground states. In the general case, terms will also appear in the 
nuclear interaction energy which do not depend exclusively on the distance r. 


2. Approximate solution of the radial equation 

The radial equation has the same form as the eigenvalue equation of the 
Hamiltonian of a one-dimensional problem in which a particle of mass is placed 
in the effective potential: 

W) - V{r) + +- 1 *’ (4) 

2 nr 


a. THE ZERO ANGULAR MOMENTUM STATES (/ = 0) 

For 1 = 0, the “centrifugal potential” /(/ + \)h 2 j2ixr 2 is zero, and K«(r) 
is then the same as V(r). In the neighborhood of the minimum at r = r e , V(r) 
can be expanded in powers of r — r e : 

V{r) = - K + fir ~ r e ) 2 - g(r - r e ) 3 + ... (5) 
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/ and g are positive since r = r e is a minimum and the potential increases faster 
for r < r e than for r > r e . 

We begin by neglecting the term in (r — r e ) 3 and terms of higher order. 
The potential is then purely parabolic, and we know the eigenstates and eigenvalues 
of the Hamiltonian. If we set: 


(0 = 


j2f 

V P 


( 6 ) 


we obtain levels whose energy is: 

K 0 = - v o + (” +tyo> 

(t> = 0, 1,2,...) 

with the associated wave functions (cf. chap. V and complement B v ): 

"” |r| = (t)"*-- rj] 

with: 



( 7 ) 


( 8 ) 


( 9 ) 


{H v is a Hermite polynomial). In figure 1, we have represented the first two 
energy levels by horizontal lines. The length of the lines gives an idea of the 
extension ( Ar) v of the wave functions corresponding to these states. Recall 
[chap. V, formula (D-5-a)] that: 


( Ar i - 



+ ■ 


ft 

jlCO 


( 10 ) 


For the preceding calculation to be valid, it is necessary, in a region of 
width (Ar) about r = r e , for the term in (r — r e f of (5) to be always negligible 
compared to the term in (r - r e ) 2 . We must therefore have : 


/ ► 9i A r) v = g(Ar) 0 jv (11) 

V 2 


where ( Ar) 0 is the extension of the ground state: 

( Ar )o = 

This implies, in particular that: 


lL 


( 12 ) 


f > 9 MOo 


(13) 
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Condition (13) is always satisfied in practice. We shall confine ourselves in 
what follows to quantum numbers v which are small enough for (II) to be 
satisfied as well. 

comment: 

Expansion (5) is obviously not valid at r = 0, where V(r) is infinite. 
The preceding argument implicitly assumes that: 

(^r) r < r e (14) 

In this case, the wave functions (8) are practically zero at the origin, and 
almost identical to the exact solutions of the radial equation (3), which must 
be rigorously zero at r = 0 (cf. §A-2-c of chapter VII). 


b. GENERAL CASE (/ ANY POSITIVE INTEGER) 

a. Evaluation of the eff ect of the centrifugal potential 
At r = r e , the centrifugal potential is equal to: 

^ + = Bhl{l + 1) 

2 prl 

where: 

B = —> 

4npr 2 


(15) 

(16) 


is the rotational constant introduced in complement C VI . We have already pointed 
out in that complement (§4-a-P) that the energy 2 Bh (the distance between two 
adjacent lines of the pure rotational spectrum) is always very much smaller than 
hco (the vibrational quantum): 

2 Bh < ho) (17) 


We shall confine ourselves here to rotational quantum numbers / sufficiently 
small that : 

Bhl(l + 1) < Juo (18) 

In a domain of small width Ar about r = r e , the variation of the centrifugal 
potential is of the order of: 


/(/ + ^ Ar = 2 Bh l(l + 1 
prl 



(19) 


That of the potential V(r) is approximately: 


f(Ar) 2 = — poj 2 (Ar) 2 = 


2 {Ar)l 
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where we have used (12). We know from §2-a that the extension Ar of the wave 
functions we shall be considering is negligible with respect to r e , but certainly 
at least of the order of (Ar) 0 . Consequently, in the region of space in which the 
wave functions have significant amplitudes, the variation (19) of the centrifugal 
potential is, according to (18), much smaller than that of V(r) found in (20). 
We can then, to a first approximation, replace the centrifugal potential in 
equation (4), by its value (15) at r = r e , which gives for the effective potential: 

Kff W * nr) + Bh 1(1+ 1) (21) 


p. Energy levels and stationary wave functions 

By using (21) and neglecting terms of order greater than two in expansion (5), 
we can put the radial equation (3) in the form: 


- 


t> 2 d 2 1 2 ' 

^ , I +-/«« ('' - r e) 

2jU dr 2 


k» = IX, +V 0 -Bh 1(1 + !)]«„» (22) 


which is completely analogous to the eigenvalue equation of a one-dimensional 
harmonic oscillator. 

From this, we deduce immediately that the term in brackets on the 
right-hand side must be equal to (e + 1/2 )^<d, where v = 0, 1, 2, ...; this yields 
the possible energies E v l of the molecule : 

K, = -V 0 +(v +^jha> + Bh 1(1 + 1) (23) 

with: 

f V = 0, 1, 2... 

1 / = 0, 1, 2... 


As for the radial functions, they do not depend on /, since the differential operator 
appearing on the left-hand side of (22) does not depend on /. Consequently, we 
have: 


= u v (r) (24) 

where u v (r) was given in (8). Expression (2) for the wave functions of the 
stationary states can then be written, in this approximation: 

<P v ,Jr,0,<p)=±u v (r)Y?(0,(p) (25) 

Thus we see that the energies of the stationary states are the sums of the 
energies calculated in complements A v and C VI , in which only one degree of 
freedom at a time (vibration or rotation) was taken into account. In addition, 
the wave functions are the products of the wave functions found in these two 
complements, to within a factor of 1/r. 

Figure 2 shows the first two vibrational levels v = 0 and v = 1, with their 
rotational structure due to the term Bh 1(1 + 1). 
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FIGURE 2 

Diagram showing the first two 
vibrational levels (u = 0 and v = 1) 
of a diatomic molecule and their ro¬ 
tational structure (/ = 0, 1 , 2,...). 
Within the limits of the approxi¬ 
mations, this rotational structure is 
the same for the various vibrational 
levels. For a heteropolar molecule, 
the transitions represented by the 
vertical arrows in the figure yield 
the lines of the vibrational-rota¬ 
tional spectrum of the molecule. 
These lines fall in the infrared. 
These transitions obey the selection 
rule M = /' - / = +1. 


c. THE VIBRATIONAL-ROTATIONAL SPECTRUM 

We shall confine ourselves to the study of the infrared absorption or 
emission spectrum, thereby assuming the molecule to be heteropolar (calculations 
analogous to those presented in § l-c-|3 of A v and §4-b of C vl could be performed 
when dealing with homopolar molecules and the Raman effect). 

a. Selection rules 

Recall that the dipole moment D(r) of the molecule is directed along the 
straight line joining the two nuclei and can be expanded in powers of r — r e 
about r e : 

D{r) = d 0 + d t (r - r e ) + ... (26) 

The projection of this dipole moment onto Oz is equal to D(r) cos 6 (where 0 is 
the angle between the axis of the molecule and Oz). 

We want to determine the frequency spectrum of electromagnetic waves 
polarized along Oz that the molecule can absorb or emit as a consequence 
of the variation of its electric dipole. As we have done several times before, we shall 
look for the Bohr frequencies that can appear in the time evolution of the mean 
value of D(r) cos 6. All we must do, then, is find for what values of v', /', m' and 
v, l, m the matrix element: 

< | D ( r ) COS 0 | (p cXm > 

= f r 2 dr d Q ft <p) D ( r ) cos e . ft <p) ( 27 ) 
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is different from zero. Using expression (25) for the wave functions, we put 
this matrix element in the form: 


L_ JO 


dr u*(r) D(r) u v (r) 


\ x 


dG Y?'*((h (p) cos 0 Y?(0, (p) 


(28) 


We thus obtain a product of two integrals which have already been treated 
in complements A v and C V |. The second integral is different from zero only if: 

/'-/= + 1, - 1 (29) 

As for the first one, if we confine ourselves to the terms in d 0 and d x of (26), it is 
different from zero only if: 

v’ - v = 0, + 1, - 1 (30) 


The set of lines corresponding to v — v' = 0 constitutes the pure rotational 
spectrum studied in complement C vl (its intensity is proportional to dl). As for 
the lines v' ~v = ± 1, i — 1= + 1, of intensity proportional to dj, they constitute 
the vibrational-rotational spectrum which we shall now briefly describe. 


comment: 

The selection rule /' — / = +1 arises from the angular dependence of the wave 
functions. It is therefore independent of the approximation used to solve the radial 
equation (3), while (30) is valid only in the harmonic approximation. 

p. Form of the spectrum 

Let v' be the larger of the two vibrational quantum numbers under consi¬ 
deration (v' = v + 1). The vibrational-rotational lines can be separated into two 
groups : 

— the lines v’ = v + 1 ,/' = / + 1 i\ /, of frequencies: 

^ + B(l + 1)(/ + 2) - Bl(l + 1) = ^ + 2 B(l + 1) (31) 

with / = 0, 1, 2, ... 

(these lines correspond to the transitions indicated by the arrows on the right-hand 
side of figure 2). 

— the lines v' = v + I, /' = / — 1 •*—> v , /, of frequencies: 

^- + Bl’(l' + 1) - B(l' + 1)(/' + 2) = - 2 B(l + 1) (32) 

In In 

with /' = 0, 1, 2, ... 

(transitions indicated by the arrows on the left-hand side of figure 2). 

The vibrational-rotational spectrum therefore has the form shown in figure 3. 
It contains two groups of equidistant lines, symmetrical with respect tb the 
vibrational frequency (o/2n. All these lines together constitute a “band”. The 
group of lines corresponding to frequencies (31).is called the “/? branch”, and 
the one corresponding to frequencies (32), the ‘\P branch”. In each branch, the 
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distance between two adjacent lines is 2B. The central interval separating the two 
branches is of width 4B : there is no line at the pure vibrational frequency 
co/2n (there is often said to be a “missing line” in the spectrum). 


3—4 2—3 1—2 0—1 


1-0 2—1 3—2 4<-3 


P branch 


co/2n 


R branch 


figure 3 

The vibrational-rotational spectrum for a heteropolar molecule. Since transitions between levels of 
tigure 2 with the same value of / are forbidden by the selection rules, no line has the pure vibrational 

frequency ^-.Transitions in which the molecule passes from the level (v\ /') to the level (v = v' — 1, 


/ = /' — !) correspond to frequencies + 2 B(l + 1) (lines of the “/? branch’ ). Transitions in 

which the molecule passes from the level (t‘\ /') to the level (v — v f — \ ,/ = /'+ 1) correspond 

to frequencies - 2 B(l + 1) (lines of the “Z 5 branch”). The different lines are labeled /' > l 

2k 

in the figure. 


comment: 

The “pure vibrational” spectrum, studied in A v and composed of 
a single line at u>l2n , therefore does not exist in practice. It is only when 
one uses a spectroscopic device with low resolution that one can ignore 
the rotational structure of the vibrational-rotational line and treat the band 
of figure 3 like a single line centered at co/2n (recall that a>/2n 2B). 


3. Evaluation of some corrections 


The calculations of the preceding section are based on the approximation 
which consists of replacing the centrifugal potential by its value at r = r e in 
the radial equation. The effective potential l£ ff (r) can then be obtained from V(r) 
by a simple vertical translation [formula (21)]. 

In this section, we shall study the corrections which must be performed 
on the results of § 2 in order to take into account the slow variation of the 
centrifugal potential about r = r e . To do so, we shall use its expansion in powers 
of (r - r e ): 


/(/ + 1 )fi 2 _ 1(1 + l)ft 2 

2 nr 2 2 nrl 


/(/ + l)ft 2 

Hr] 


. , 3/(/ + l)7r . , 2 

r e) + — 7 . (r - r e) + - (33) 


2 nr\ 
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a. MORE PRECISE STUDY OF THE FORM 
OF THE EFFECTIVE POTENTIAL V t(f (r) 

If we use (5) and (33), the expansion of the effective potential (4) in the 
neighborhood of r = r e can be written : 


Kd r ) = - v o + fir - r ef - d(r ~ r e ) 3 + ... 

1(1 + l)h 2 1(1 + l)h 2 , , , 31(1 + 1 fi 2 , , 2 , 

-- - - 1 -r— ir - r e ) (r - r e ) 2 + 


+ 




UK 


2 UK 


(34) 


We shall see that the variation of the centrifugal potential in the neighborhood 
of r = r e produces, for / different from zero, the following effects: 

(/') The position r e of the minimum of (r) does not coincide exactly 
with r e . 

(ii) The value Kff(?e) of this minimum is slightly different from 
- V 0 + Bh 1(1 + 1). 

(Hi) The curvature of at r = r e [which fixes, as in formula (6), the 

angular frequency of the equivalent harmonic oscillator] is no longer strictly 
given by the coefficient /. 

We shall evaluate these various effects by using expansion (34). As far as the 
first two are concerned, we can neglect terms of order higher than 2 in V(r), and 
those of order higher than 1 in the centrifugal potential, since the distance 
r e — r e which we shall find is very small [it will even be small relative to (zlr) 0 ]. 
In fact, we shall be able to verify a posteriori that: 


Q (r e ~ r e) < f 

W + I )* 2 « r . „ V + w 2 

—“ —7— (r e - r e) < -;— 


(35-a) 

(35-b) 


a. Position and value of the minimum of f^ ff (r) 

If, in expansion (34), we keep only the first two terms of V(r) and the first 
two terms of the centrifugal potential, r e is given by: 


2 f(r e - r e ) ~ -i- 

d r e 


(36) 


that is: 

/(/ + l)^ 2 Bhl([ + 1) 

" e re * 2pfr] ~ fr e 

According to (6) and (12), we have: 

r g -r e 2Bh 1(1 + 1) {Ar) Q 

(Ar) 0 h(o r e 

which, with (13) and (14) taken into account, proves (35-a) and (35-b). 


(37) 


( 38 ) 
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Substituting this value of r e into the expansion of f£ ff (r), we find: 
Krr (K) * ~ K + Bh 1(1 4 1) - Gh[l(l 4 l)] 2 

with: 


(39) 

(40) 


p. Curvature of l£ ff (r) at its minimum 

In the neighborhood of r = r e , we can therefore write I e ’ ff (r) in the form: 

Ku {r) = Krr (K) + f'(r ~ K) 2 ~ ~ r e ) 3 4 ... (41) 

The coefficient /' is related to the curvature of (r) at r = r e : 



(42) 


To evaluate the difference between /' and /, we must take into account the 
term in (r — r e ) 3 of V(r) in expansion (34) and, consequently, also the term in 
(r — r e ) 2 of the centrifugal potential. A simple calculation then yields, using (37): 


2/' — 2/4 


3/(? 4 l)h 2 3 g 1(1 4 \ fi 2 


nr; nrj 

The angular frequency u) defined by (6) must therefore be replaced by: 

ft 

v " 


Expanding the square root, we easily find: 
w' = o) — 2na e 1(1+ 1) 

with: 

a _ 3ft 2 ft> _ j_ _ 

8 n^rlf If r e _ 


(43) 


(44) 


(45) 

(46) 


We could carry out an analogous calculation to determine g’. Actually, 
since the term in (r — r e ) 3 of (41) adds only a small correction to the results 
obtained using the first two terms, we shall neglect the variation of 


_d^ 

dr 3 


V tt fr) when we go from r e to r e , and take g' 


* 9- 


In conclusion, in the neighborhood of its minimum, we can write 1£ ff (r) in 
the form: 


Kff(r) - Kft(r e ) + lna>' 2 (r ~ rf ~ d( r ~ ? e ) 


(47) 


where r e , l{ ff (r e ), ft)' are given by (37), (39) and (45). 
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b. ENERGY LEVELS AND WAVE FUNCTIONS 
OF THE STATIONARY STATES 


With expression (47) for the radial equation becomes: 




~f~ 2 +{no>' 2 (r ~ K) 2 ~ Vi? ~ K? k» = [£,, - (48) 

2g d r 4 2 


If, as in §2, we neglect the term in g(r — r e ) 3 , we recognize the eigenvalue 
equation of a one-dimensional harmonic oscillator of angular frequency (o' whose 
equilibrium position is r = r e . From this, we deduce that the only possible values 
of the term in brackets on the right-hand side are. (u + l/2)h(o', with v = 0, 1, 2, ... 
According to (39), we therefore have: 

E vl = - V 0 + (v +^\hu)' + Bh 1(1 + 1) - Gh[l(l + l)] 2 (49) 


As for the wave functions of the stationary states, they have the same 
form as in (25). All we need to do in expression (8) for the radial function is replace 
r e by r e and p by: 

p' = !^f (50) 

We have taken into account the term in g(r — r e ) 3 in the calculation of the 
new angular frequency (o'. For the calculation to be consistent, it is then necessary 
to evaluate the corrections in the eigenvalues and eigenfunctions of the radial 
equation due to the presence of this term on the left-hand side of (48). We shall 
do this in complement A XI , using perturbation theory. Here, we shall content 
ourselves with stating the result concerning’the eigenvalues : we must add to 
expression (49) for the energy the term: 

i*o'(v + ^ (51) 


where: 


15 

4 g 3 (o' s 


(52) 


is a dimensionless quantity much smaller than 1 (hence a>' may be replaced by 
(o in this corrective term). 


C. INTERPRETATION OF THE VARIOUS CORRECTIONS 

a. Centrifugal distortion of the molecule 

The discussion of §3-a-a shows that the distance between the two nuclei 
increases when the molecule rotates. According to (37), this increase in distance 
becomes larger when /(/ + I) becomes larger, that is, when the molecule rotates faster. 
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This is quite comprehensible : in classical terms, one would say that the "centrifugal 
force” tends to separate the two nuclei until it is balanced by the restoring force 
2 f(r e — r e ) due to the potential V(r). 

The molecule is therefore not really a “rigid rotator”. The variation r e — r e 
of the average distance between the nuclei produces an increase in the moment 
of inertia of the molecule and, consequently, a decrease (at constant angular 
momentum) in the rotational energy. This decrease is only partially compensated 
by the increase in the potential energy V(r e ) — V(r e ). This is the physical origin 
of the energy correction — Ghl 2 (l + l) 2 which appears in (49). This correction, 
whose sign is negative, increases much faster with / than the rotational energy 
Bh /(/ + 1). This can be seen experimentally: the lines of the pure rotational 
spectrum are not rigorously equidistant; the separation of the lines decreases when 
/ increases. 


(3. Vibrational-rotational coupling 

We shall group the second and third terms of (49) and replace to' by its 
expression (45). We obtain: 

v + ^jhv)' + Bh 1(1 +\) = (^v +^jhio + Bh l{l + 1) - a e hl(l + 1)^ +^j 

(53) 

The first two terms on the right-hand side of (53) are the vibrational and 
rotational energies calculated in complements A v and C vl . The third term, which 
depends on the two quantum numbers v and /, represents the effects of the 
coupling of the vibrational and rotational degrees of freedom. 

We can rewrite (53) in the form: 

+^jho> + B v hl{l +1) (54) 

with: 

B V = B-Jv +^\ (55) 

It looks as if each vibrational level had an effective rotational constant B v 
depending on v associated with it. 

To explain this coupling of the vibration and rotation of the molecule, we 
shall argue in classical terms. The rotational constant B is proportional to lJr 2 
[formula (16)]. When the molecule vibrates, r varies, and, consequently, so does B. 
Since the vibrational frequencies are much higher than the rotational ones, we can 
define an effective rotational constant of the molecule in a given vibrational 
state : this will be the average of B taken over a time interval which is much 
longer than the vibrational period. We must therefore take the time average of 
1/r 2 in the vibrational state under consideration. 
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In this way, we can interpret the two terms of opposite sign which appear in 
expression (46) for <x e . The first of these terms, which is proportional to g , is due 
to the anharmonicity of the potential V(r), which increases with the amplitude of 
vibration (that is, in fact, with v). Given the asymmetric form of V(r) (fig. 1), 
the molecule “spends more time” in the region r > r e than in the region r < r e . 
It follows that the average value of 1 /r 2 is less than l/r 2 : the anharmonicity 
decreases the effective rotational constant. This can be seen in formulas (55) and 
(46). Actually, even if the vibrational motion were perfectly symmetrical with 
respect to r e (that is, if g were zero), the average value of l/r 2 would not be equal 
to 1 /r 2 , since: 




<r> 2 


(56) 


This is the origin of the second term of expression (46) : when the average of 
l/r 2 is taken, small values of r are favorized, so that < l/r 2 ) is greater than 
l/< r > 2 ; hence the sign of this second correction. 

The overall sign of a e results from the competition between the two 
preceding effects. In general, it is the anharmonicity term which dominates, 
so that a e is positive and B v is less than B. 


comments: 

(0 Vibrational-rotational coupling exists even in the vibrational ground state v = 0: 

= (57) 

This is another manifestation of the finite extension (Ar) 0 of the wave function of 
the v = 0 state. 

(») Experimentally, vibrational-rotational coupling appears in the following way: if a e is 
positive, the rotational structure is slightly more compact in the higher vibrational 
state v' than in the lower vibrational state v == v' — 1. It is easy to show that thf P 
and R branches of figure 3 are affected differently. Adjacent lines are no longer comple¬ 
tely equidistant and are, on the average, closer together in the R branch than in 
the P branch. 


To sum up, the energy of a vibrational-rotational level of a diatomic 
molecule, labeled by the quantum numbers v and /, is given by: 


E vl = - V 0 + + ^B - +-^j 

- Gh l 2 (l + l) 2 + haj + -^ ft® 


h /(/ + 1) 


V 0 : dissociation energy of the molecule; 
co/2n: vibrational frequency; 

B: rotational constant given by (16); 

G, a e , dimensionless constants given by (40), (46) and (52). 


( 58 ) 
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References and suggestions for further reading: 

Molecular spectra : Eisberg and Resnick (1.3), chap. 12; Pauling and Wilson (1.9), 
chap. X; Karplus and Porter (12.1), chap. 7; Herzberg (12.4), Vol. I, chap. Ill, 
§§2 b and 2 c; Landau and Lifshitz (1.19), chaps. XI and XIII. 

Nuclear vibration and rotation: Valentin (16.1), §VII-2. 
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Complement G V u 
EXERCISES 


1. Particle in a cylindrically symmetrical potential 

Let p , <p, z be the cylindrical coordinates of a spinless particle 
(x = p cos (p, y — p sin (p ; p ^ 0, 0 < (p < 2n). Assume that the potential energy 
of this particle depends only on p , and not on (p and z. Recall that: 

AA + AL = AL + LA AAL 

dx 2 8y 2 dp 2 p dp p 2 dip 2 

a. Write, in cylindrical coordinates, the differential operator associated with 
the Hamiltonian. Show that H commutes with L z and P z . Show from this that 
the wave functions associated with the stationary states of the particle can be 
chosen in the form: 


<Pn.m,k(P . V. ') = f».m(p) *** ^ 

where the values that can be taken on by the indices m and k are to be specified. 

b. Write, in cylindrical coordinates, the eigenvalue equation of the Hamiltonian 
H of the particle. Derive from it the differential equation which yields J n m {p). 

c. Let Z y be the operator whose action, in the { | r ) } representation, is to 
change y to — y (reflection with respect to the xOz plane). Does Z y commute with //? 
Show that Z y anticommutes with L z , and show from this that Z y \ (p„ mk ) is an 
eigenvector of L z . What is the corresponding eigenvalue? What can be concluded 
concerning the degeneracy of the energy levels of the particle? Could this result 
be predicted directly from the differential equation established in (£>)? 


2. Three-dimensional harmonic oscillator 
in a uniform magnetic field 

N.B. The object of this exercise is to study a simple physical system for 
which the effect of a uniform magnetic field can be calculated exactly. Thus, it is 
possible in this case to compare precisely the relative importance of the “para¬ 
magnetic” and “diamagnetic” terms, and to study in detail the modification of the 
wave function of the ground state due to the effect of the diamagnetic term. 
(The reader may wish to refer to complements D V1 and B v „.) 

Consider a particle of mass p, whose Hamiltonian is: 

«o =-^ + R! 

(an isotropic three-dimensional harmonic oscillator), where o) 0 is a given positive 
constant. 
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a. Find the energy levels of the particle and their degrees of degeneracy. 
Is it possible to construct a basis of eigenstates common to H 0 , L 2 , L z ? 

b. Now, assume that the particle, which has a charge q, is placed in a uniform 
magnetic field B parallel to Oz. We set o) L = — qB/2n. The Hamiltonian H of the 

particle is then, if we choose the gauge A = - ^ r x B : 

H = H 0 + H,K) 

where H l is the sum of an operator which is linearly dependent on co L (the 
paramagnetic term) and an operator which is quadratically dependent on co L 
(the diamagnetic term). Show that the new stationary states of the system and 
their degrees of degeneracy can be determined exactly. 

c. Show that if a> L is much smaller than (o 0 , the effect of the diamagnetic 
term is negligible compared to that of the paramagnetic term. 

d. We now consider the first excited state of the oscillator, that is, the 
states whose energies approach 5Tku 0 /2 when o>, —*■ 0. To first order in (oj(o 0 , 
what are the energy levels in the presence of the field B and their degrees of 
degeneracy (the Zeeman effect for a three-dimensional harmonic oscillator)? 
Same questions for the second excited state. 

e. Now consider the ground state. How does its energy vary as a function 
of co L (the diamagnetic effect on the ground state)? Calculate the magnetic 
susceptibility x °f this state. Is the ground state, in the presence of the field B, 
an eigenvector of L 2 ? of L z ? of L x ? Give the form of its wave function and the 
corresponding probability current. Show that the effect of the field B is to 
compress the wave function about Oz (in a ratio [1 -I- (u)Jid 0 ) 2 ] 1/4 ) and to 
induce a current. 
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